Economics of the Firm

Economics of the Firm focuses on the economic issues and problems related to business organization, management, and strategy. Issues and problems include: an explanation of why corporate firms emerge and exist; why they expand: horizontally, vertically and spacially; the role of entrepreneurs and entrepreneurship; the significance of organizational structure; the relationship of firms with employees, providers of capital, customers, and government; and interactions between firms and the business environment.

[title text=”Main contents” link_text=”See more from basic to advanced” link=”/category/economics-of-business-firm/”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”6000″ cat=”123″ posts=”3″ offset=”45″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”6000″ cat=”123″ posts=”3″ offset=”42″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”6000″ cat=”123″ posts=”3″ offset=”39″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”6000″ cat=”123″ posts=”3″ offset=”36″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”6000″ cat=”123″ posts=”3″ offset=”33″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”5000″ cat=”123″ posts=”3″ offset=”30″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”5000″ cat=”123″ posts=”3″ offset=”27″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”5000″ cat=”123″ posts=”6″ offset=”21″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”6000″ cat=”123″ posts=”6″ offset=”15″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”7000″ cat=”123″ posts=”6″ offset=”9″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[blog_posts style=”normal” col_spacing=”xsmall” columns=”3″ columns__md=”1″ depth_hover=”2″ auto_slide=”8000″ cat=”123″ posts=”9″ offset=”0″ show_date=”false” excerpt_length=”25″ comments=”false” image_height=”60%” image_size=”original” image_hover=”zoom” text_align=”left”]

[row style=”small” class=”form-lien-he”]

[col span=”2″ span__sm=”12″]

[/col]
[col span=”4″ span__sm=”12″]

[button text=”Home” color=”secondary” style=”gloss” radius=”5″ depth=”2″ depth_hover=”3″ expand=”true” icon=”icon-star” icon_pos=”left” link=”https://phantran.net/”]

[/col]
[col span=”4″ span__sm=”12″]

[button text=”See basic to advanced” style=”gloss” radius=”5″ depth=”2″ depth_hover=”3″ expand=”true” icon=”icon-checkmark” icon_pos=”left” link=”/category/economics-of-business-firm/”]

[/col]
[col span=”2″ span__sm=”12″]

[/col]

[/row]

Business Management: nature and scope

Business management (also known as business administration) is the administration of a commercial enterprise. It includes all aspects of overseeing and supervising business operations. From the point of view of management and leadership, it also covers fields that include office building administration, accounting, finance, designing, development, quality assurance, data analysis, sales, project management, information-technology management, research and development, and marketing.

The administration of a business includes the performance or management of business operations and decision-making, as well as the efficient organization of people and other resources to direct activities towards common goals and objectives. In general, “administration” refers to the broader management function, including the associated finance, personnel and MIS services.

Administration can refer to the bureaucratic or operational performance of routine office tasks, usually internally oriented and reactive rather than proactive. Administrators, broadly speaking, engage in a common set of functions to meet an organization’s goals. Henri Fayol (1841-1925) described these “functions” of the administrator as “the five elements of administration”.  According to Fayol, the five functions of management are;

  1. Planning.
  2. Organizing.
  3. Commanding.
  4. Coordinating.
  5. Controlling

Sometimes creating output, which includes all of the processes that generate the product that the business sells, is added as a sixth element.

Alternatively, some analyses view management as a subset of administration, specifically associated with the technical and operational aspects of an organization, and distinct from executive or strategic functions.

The gains from the trade of the firm

1. PRODUCTION AND THE FIRM

The firm is not an easy economic concept to define. Everyone accepts that IBM or ICI or Ford constitute ‘firms’, but from an economic as distinct from a purely legal point of view it is necessary to discover what underly­ing principles enable us to refer to such international giants using the same word as might be used for the local grocer’s retail outlet. Further, if the local grocer’s shop is a ‘firm’ would the same be true of a small hospital run by a charitable foundation, or a church, or even a family? Established text­books on the principles of economics typically reveal little curiosity about this issue. The firm is simply the fundamental microeconomic unit in the theory of supply. Firms exist and can be recognised by their function, which is to transform inputs of factors of production into outputs of goods and services. With some notable exceptions, the implied asymmetry between the theory of demand, with its emphasis on the individual consumer as the ulti­mate microeconomic building block, and the theory of supply, with its emphasis on the firm, is rarely explored.

Conventional theory does, however, provide a clue to the nature of ‘the firm’. The process of production usually involves coordinating the activi­ties of different individuals. Suppliers of labour, capital, intermediate inputs, raw materials and land cooperate with one another to produce outputs of goods and services. The institutional setting in which this coor­dination of activity is attempted may vary enormously, but where economic agents cooperate with one another not through a system of explicit con­tracts that bind each to every other member of the group but through a system of bilateral contracts in which each comes to an agreement with a ‘single contractual agent’, the essential ingredient of ‘the firm’ is present. It is therefore the nature of the contractual arrangements that bind individu­als together which, at least from the point of view of economic theory, con­stitutes the central preoccupation of the theory of the firm. Much of this book will be concerned to elaborate upon this basic idea and to investigate the insights that flow from it.

2. SCARCITY

Most expositions of elementary economic analysis start with a statement to the effect that economics is concerned with choice. If individuals are con­fronted by limited resources they must choose between alternatives. Following the definition of Robbins (1935, p. 16), ‘Economics is the science which studies human behaviour as a relationship between ends and scarce means which have alternative uses.’ As Robbins recognised, this definition is not of very great interest when applied to isolated individuals. A lone individual would have an allocation problem to solve, but a student of such a person’s activities would find it difficult to go further than asserting the proposition that out of all the perceived available courses of action, the iso­lated decision-maker chooses the alternative that he or she most prefers.

As part of a community of individuals, however, individuals are con­fronted with a more complex problem. They will usually find that their best strategy is not to cut themselves off from all communication with their fellows, but rather to coordinate their activity with that of other people. Making the best use of scarce resources will therefore involve forming agreements with others, and economics then becomes the study of the social mechanisms which facilitate such agreements. Hidden in this state­ment, however, are two rather different preoccupations.

First, it is possible to ask in any given situation what particular alloca­tion (or allocations) of resources, what set of agreements, would be best in the sense that no individual or set of individuals within the entire commu­nity could gain by opting out and substituting alternative feasible arrange­ments. Economists express this idea in the technical language of game theory as identifying allocations of resources which are in ‘the core’ of a market exchange game.3 Suppose there were a community of four individ­uals. Each, we may assume, could live the life of a recluse, but none wishes to do so if cooperation with the others is capable of adding to his or her perceived satisfaction. The four meet and discuss various proposals which will make them all better off. No one will accept a deal which reduces their well-being below that of an isolated recluse, and similarly no final agree­ment would hold if any two or possibly three out of four could benefit by coming to some alternative arrangement between themselves. A set of agreements which it is in no single individual’s or group’s interests to renounce in favour of an available alternative, is said to be in the ‘core’. Allocations of resources that are ‘core’ allocations represent in one sense a ‘solution’ to the resource allocation problem. Sections 3 to 6, below, present a brief description of this tradition in economics using a very simple example. Proceeding in this way, the unsuitability of the theoretical appa­ratus of general economic equilibrium for the investigation of economic organisation is thrown into sharp relief.

Economic organisation is more closely concerned with the process by which agreements are formulated. If the tastes and preferences of our four individuals are known to the economist; if their skills and endowments of resources including tools and equipment as well as natural resources, raw materials and land are likewise clearly defined; and if all the feasible options technically available from using these resources in various combi­nations can be listed, it should be possible in principle to work out all the mutually advantageous agreements which potentially exist. Working out allocations of resources which are in the ‘core’ becomes a matter of mere ‘calculation’. All the necessary information which is formally required to uncover a ‘solution’ is present, and the rest can be accomplished by a sufficiently devoted mathematician or adequately powerful computer. This, though, tells us nothing about the methods adopted by our four individu­als to solve their resource allocation problem.

Imagine, for example, that by some fluke of history the four people meet on an otherwise uninhabited island (the sole survivors from four separate shipwrecks). Each person will have little idea of the skills possessed by his or her associates. Indeed each may be in some doubt about his or her own capabilities in the new and unfamiliar environment. The potential of the island to sustain life, the characteristics and uses of the available resources, the best methods of using these resources for various purposes (making clothes, building shelter, finding food, and so on) are all a matter of guess­work and hunch. Setting up the problem in this way makes it clear that what our four individuals lack most is not a calculator, but information.

Facing the appalling problems of survival, the four islanders are likely to agree to cooperate with one another. These agreements will not repre­sent a ‘solution’ to the problem of resource allocation in any ideal sense, since no one can possibly know what the ideal way of proceeding entails. Instead, agreements between the four represent stages in a process of dis- covery.4 As time advances, experience will reveal something of the relative talents of the individuals and the properties and potential of the available resources. Arrangements between the individuals are continually modified in the light of past experience and of expectations about the future. In this framework it is still possible to argue that the subject matter of economics

is the allocation of scarce means between competing uses, but it is clear that the nature of the economic problem when opportunities are not fully known is quite different from the problem conceived of as making the best of available resources in the context of perfect information.

3. THE ALLOCATION PROBLEM

At this stage it will prove useful to develop the theme further by reference to a simple example of the sort frequently encountered in basic textbooks on the principles of economics. We continue to assume that the world con­sists of four individuals who possess differing endowments of resources. Conventional analysis then proceeds on the assumption that the limited resources available to each individual permit them to produce various known combinations of goods and services. Suppose for simplicity that people desire only two goods (x and y). With the resources at his or her dis­posal, person A can produce any combination of x and y in the area aa’0 illustrated in Figure 1.1. Given that both x and y confer benefits on person A, it is inconceivable that he or she would choose to produce at any point inside the line aa’. This line aa’ is called person A’s production possibility curve. Its downward slope reflects the fact that the production of more of any one good requires resources to be diverted from the production of the other. The steepness of the curve indicates the amount of y which has to be sacrificed to produce an extra unit of x. In the case of person A, one more unit of x entails the sacrifice of four units of y. Thus the slope of the pro­duction possibility curve indicates the marginal opportunity cost of an extra unit of x. If person A produces more x its marginal cost will be 4y. Note that the cost of x can be interpreted as a physical and objective measure (the amount of y forgone) only because we have assumed that all the options available to A are known to him or her with complete certainty.

Each of the other people (labelled B, C and D) will also face constraints on their ability to produce. The constraints are represented by the lines bb’, cc’ and dc’ in Figure 1.1. Notice that some individuals are luckier than others. All points on person B’s production possibility curve are unattain­able by any other person. Notice also that the marginal costs of production differ for each person. Person D, for example, is relatively poor but in terms of y sacrificed he or she is the cheapest producer of x.

As solitary individuals, each person will have to pick a point on his or her production possibility curve. Suppose, for example, that x and y were not substitutable in consumption and that everyone consumed these goods in fixed proportions (say equal quantities of each). In the absence of trade, consumption points would be given where production possibility curves intersect a 45° line through the origin. The total output of the community will be 8x and 8y. The individual consumption and production levels of each person are recorded in Table 1.1.

One of the most enduring discoveries in economic theory, first fully established by David Ricardo, suggests, however, that these four individu­als could do much better through specialisation and exchange. Consider the curve TT in Figure 1.1. This is referred to as the ‘community outer-bound production possibility curve’ or the ‘community transformation curve’. Given the production constraints facing each individual, it is easy to see that if all four people produced product x they could between them achieve 12.5 units of output. If now some y is to be produced, it will entail the sacrifice of some x and it seems reasonable to allocate the person to y pro­duction who can produce it at least cost. This person is A, for whom each unit of y will entail the sacrifice of only 0.25 units of x. Person A has the greatest ‘comparative advantage’ in y production of the four individuals. If A specialises in y production and persons B, C and D specialise in x pro­duction, the community in total will achieve an output of ten units of x and ten units of y (point A). Further y production can only be achieved by using another person in addition to A. The person who can produce further y at least marginal cost is now person B for whom the marginal cost of y is 0.33x. Complete specialisation of both A and B in y production and of C and D in x production would enable the community to achieve six units of x and 22 units of y (point B). Yet further y production must now involve person C, for whom the marginal cost is 0.5x, and so forth.

Given the rather extreme assumptions we have made about consumption patterns, it is clear that point A will represent the best production point. Specialisation has resulted in an increase of community output of two units of x and two of y.No alternative arrangements exist which would permit the achievement of any points further out along the 45° line. We would expect, therefore, that any agreement between the four individuals would involve A specialising in y production and B, C and D specialising in x production.

This still leaves open the question of how the benefits of specialisation are to be distributed. We might, for example, envisage one of the transactors making the following suggestion. ‘Since our joint efforts will result in a total increase in output of 2x and 2y above that achievable by our original unco­ordinated activity, let us each share equally in this benefit. Final consump­tion levels would then be as recorded in Table 1.2. All individuals achieve a consumption level 0.5 units higher than those recorded in Table 1.1.

This attempted solution will not work, however. To see why, it is neces­sary to consider all the trading options available to the various transactors. There is nothing to stop A and B, for example, from getting together and agreeing to collaborate without the others. Similarly, persons C and D might come to a separate agreement. The total output or ‘payoff’ achiev­able by all the conceivable ‘coalitions’ of people is recorded in Table 1.3.

The reader should take an entry at random and check that its meaning is clear.5 If persons C and D collaborated they could achieve a combined output of four units of x and four units of y, as is illustrated in Figure 1.2. D would specialise in x production thus yielding three units of x, while C would produce one unit of x and four units of y. In total, they therefore produce four units of each commodity.

Now consider the position of persons A and D. If they accept the deal offered in Table 1.2 they will join in an agreement which involves all four transactors with a total payoff to the ‘coalition’ of ten units of x and ten of y. Out of this, A will receive 2.5 units of each commodity and D will receive 1.5 units of each, but from Table 1.3 we see that simply by ignoring the others and striking a deal between themselves, A and D could receive a combined payoff of 4.4 units of each commodity instead of the 4.0 of Table 1.2. It follows that the ‘allocation’ of Table 1.2 is not in the ‘core’. Persons A and D could both be better off by renouncing the allocation of Table 1.2 and agreeing an alternative between themselves.

To illustrate the case of an allocation which is in the ‘core’, consider the entries of Table 1.4. Comparing the entries in Table 1.4 with those in Table 1.3, it will be confirmed that no coalition of individuals could do better by striking a separate bargain between themselves. An alliance of B, C and D, for example, could produce a payoff of 7.5, but their combined allocation in Table 1.4 is 7.6. A similar calculation can be performed for every other possible coalition. Thus the allocation of Table 1.4 is in the ‘core’ of the exchange game.

An agreement to specialise in accordance with comparative advantage and then to allocate the output as described in Table 1.4 is therefore one ‘solution’ to the economic problem of making the best out of scarce resources. It is not, however, a unique solution, as the reader can verify by checking the entries of Table 1.5 against those in Table 1.3. The three allo­cations recorded in Table 1.5 are also in the ‘core’.

4. RECONTRACTING AND THE ALLOCATION PROBLEM

In the section above, attention was focused primarily on calculating a ‘solu­tion’ to the allocation problem under certain specific conditions. Little was said explicitly about the mechanism by which a solution might be achieved. Specialisation implies the existence of a coordinating mechanism by which one person’s activities are made compatible with the actions of others. One mechanism consistent with the example of section 3 is a bargaining process. The four individuals could be seen as initially forming provisional agree­ments. If it then transpired that alternative more beneficial arrangements were possible for some individual or set of individuals (the provisional agreements did not represent a ‘core’ allocation) then the parties could ‘recontract’. The process of recontracting would continue until it was in no one’s interest to renounce the existing provisional agreement. At this point the agreement would be finalised.6 The provisional agreement summarised in Table 1.2, for example, was renounced by persons A and D. If at the end of further negotiations the agreement summarised in the first line of Table 1.5 were hit upon, this would hold and the process of recontracting would cease.

The recontracting process just described does present some awkward dilemmas for the theorist, however, for if this process means anything, it must imply that the individuals involved possess incomplete information about the production possibilities and preferences of others. If information were perfect, there would be no purpose in conducting ‘negotiations’. All the potential ‘core allocations’ or ‘solutions’ could be computed mathe­matically, as indeed we computed some in Tables 1.4 and 1.5. The big problem would then be that of choosing between a number of possible known solutions rather than discovering some particular solution or other. Choice between multiple solutions raises extremely difficult issues, since a move from one possibility to another involves some people becoming better off and others worse off (compare lines 1 and 2 of Table 1.5). Which of the many possible options available might eventually be agreed upon is there­fore not easy to determine, and it is at least conceivable that no agreement would be forthcoming. Faced with this problem, economic theorists have developed an ingenious escape route. It is possible to show that as the number of contractors in a market increases, then under certain conditions the set of ‘core’ allocations diminishes in size. Indeed, in the limit, with an infinite number of contractors the ‘core’ shrinks to a single allocation.7 No longer is there a problem of choosing between multiple solutions since only a single determinate solution exists.

For a theorist working with the full-information assumption and anxious to show the existence of a unique solution to the allocation problem, a shrinking core is no doubt a matter of some satisfaction. It is difficult to suppress the feeling, however, that where search is a costly activity, the smaller the core the more tiresome and protracted is the process of finding it. In a world in which information is discovered through the process of negotiation there would not appear to be the same compelling reasons to expect any particular outcome to occur. Indeed, it is not even clear that the final agreement will represent a core allocation. After some provisional contracts have been made the parties search around for a better deal. Nothing is finalised until each contractor finds that he or she cannot improve on his or her allocation. This, however, raises the question of how long people are prepared to search for coalitions which improve on their present position. As the number of contractors increases, so the number of potential coalitions increases exponentially and the number of core alloca­tions declines. Any commitment to try all conceivable possibilities could be likely to imply never coming to a final agreed solution.

If the process of forming contracts with one another involves the use of scarce resources, then the ‘best’ use of these scarce resources cannot be said to reside entirely in the discovery of a ‘core’ allocation. A more crucial question concerns how scarce resources are used in the process of con­tracting itself. Conventional expositions of the recontracting process and the discovery of an allocation of resources which is in the core of the exchange game are therefore suspect. Either the process described is itself a user of scarce resources, in which case it cannot be inferred that search will continue indefinitely until a solution is found, or the process does not use scarce resources, in which case it is merely an unnecessary story to cloak the ‘full-information’ assumption.

5. TATONNEMENT

The bargaining framework outlined in sections 3 and 4 deriving from the work of Edgeworth is not the usual approach adopted in elementary treat­ments of economics. It is more conventional to concentrate on the role of markets and the price system as a device for coordinating activity. Suppose, for example, that all contractors were able to exchange x for y at a ratio of one for one. For every person, the market price of x is one y and vice versa. Returning to Figure 1.1, it is seen that person A must sacrifice only 0.25 units of x in production to obtain a unit of y whereas in the market the price of y is 1x. With the marginal cost to person A of y production so much less than the prevailing price, it will be in his or her interest to spe­cialise in y production and exchange in the market. By this means, the person can achieve a production level of ten units of y and a consumption level of five units of each commodity (see Figure 1.3).

The marginal cost of y production, however, is less than the assumed pre­vailing market price for both persons B and C as well. Only person D will find it advantageous to specialise in x production, since for him or her the marginal cost of y exceeds the market price (the marginal cost of x on the other hand is less than its market price). Faced, therefore, with this ratio of exchange, the four individuals will be induced to specialise according to their area of comparative advantage and between them they will produce at point C in Figure 1.1.

Given the special nature of consumers’ preferences, however, it is clear that with production of 28 units of y and only three units of x there will be enormous excess demand for x. Equilibrium in the market requires quan­tities demanded and supplied at the prevailing price to be the same. Clearly a higher price of x relative to y is required to induce persons B and C to change their area of specialisation. Point A will be achieved if the price ratio is set between 3y for 1x and 4y for 1x. A single ratio of exchange applying to all transactors will result in a market equilibrium at point A.

This market ‘solution’ to the resource allocation problem turns out to be closely related to the concept of the ‘core’ mentioned in the last section. Suppose, for example, that the ratio of exchange were 3y for 1x. These market opportunities clearly do not affect person B, since they are exactly the same as the opportunities which confront him or her in production. He/she will continue to consume three units of each commodity. Person A, on the other hand, will specialise in y production (ten units) and exchange 7.5 units of y for 2.5 units of x, thus achieving 2.5 units of each. Both persons C and D will specialise in x (three units each) and exchange 0.75 units for 2.25 units of y, thus achieving 2.25 units of each. Comparing these results with Table 1.5, the reader can verify that they correspond with the entries on the first line. A market ratio of exchange of 3y per 1x will produce an allocation of resources in the conditions specified, equivalent to the first ‘core’ allocation of Table 1.5. As an exercise, the reader should verify that a market rate of exchange of 4y per 1x will produce a result equivalent to the ‘core’ allocation recorded on the second line of Table 1.5. Indeed, it can be rigorously proved that any competitive equilibrium will imply an allocation which is in the ‘core’.8 A given ratio of exchange apply­ing to all contractors of 3.5y per 1x will produce the third allocation of Table 1.5.

This theory of competitive equilibrium, however, suffers from similar difficulties to the exchange theory of Edgeworth discussed above. In its most general form, the theory indicates that there will exist, under specified conditions, a set of relative prices such that individual responses to these given prices will be compatible with equilibrium in every market. The ques­tion which the theory does not attempt to answer is precisely how this equi­librium set of prices is to be discovered. As Shackle (1972) puts it ‘what (general equilibrium) theory neglects is the epistemic problem, the problem of how the necessary knowledge on which reason can base itself is to be gained’ (p.447). Like the recontracting process of Edgeworth, the theory of competitive market equilibrium has an equivalent story to tell. In this case it is supposed that an ‘auctioneer’ sets prices and that people form provi­sional agreements at these given prices. If it transpires that excess demands or supplies exist, the provisional agreements lapse and the auctioneer modifies prices in an attempt to eliminate any disequilibria. This process is termed the ‘tatônnement’ process and is associated primarily with the name of Leon Walras.9

The major problem with the Walrasian auction is not simply that it does not represent an accurate representation of reality. Resource allocation is not conducted by means of Walrasian auctions and, more to the point, the reason why is not difficult to understand. Such a process would be enor­mously costly. Indeed, such is the complexity characteristic of exchange relationships that an attempt to proceed along Walrasian lines would absorb all the energies and resources of contractors without perhaps ever achieving a ‘solution’. Once more, the paradox of equilibrium theory is exposed. If all information is costlessly available, the auctioneer will get it right first time. If the process of acquiring information is costly, endless pursuit of a general equilibrium is the ultimate example of the ideal becom­ing the enemy of the good.

6. THE EQUILIBRIUM METHOD

The purpose of our brief discussion of the salient features of general equi­librium theorising conducted above is not to develop a detailed critique or to question the intellectual achievement which it represents. It is important, however, to appreciate the nature of that achievement and the implications which it holds for the theory of the firm and of economic organisation more generally. General equilibrium theory represents an existence proof. Under tightly specified conditions in a world consisting of many individu­als all with different tastes, skills and other endowments of resources, there will exist a set of relative prices of goods and factors compatible with uni­versal market clearing. Equivalently there will exist a set of agreements between the individuals which no one will wish to change. The activities of all contractors will be perfectly reconciled. For any given set of preferences, resources and technological possibilities a ‘solution’ to the resource alloca­tion problem exists in terms of specific outcomes.

Such a perfect coordination of all activity requires that agreements are concluded simultaneously and that transactions costs are zero. Knowledge of all technical possibilities both now and in the future must be assumed to be complete. The very passage of time itself can be admitted only in a very artificial sense. By extending the concept of consumers’ preferences to embrace consumption in future time periods, and of production possibili­ties to include the ‘transformation’ via investment of goods today into goods tomorrow, it is possible to envisage a set of equilibrium intertempo­ral prices. At some price ratio, the right to consume apples in period 2 may be exchanged for the right to consume nuts in period 5. The final set of agreements will then embrace transactions extending over all future time periods. Time exists as a dimension on a graph, but outcomes over time are completely predetermined at the moment of general agreement. Time is incorporated into the analysis but only at the price of robbing the concept of all meaning. Formally, ‘apples today’ and ‘apples tomorrow’ are simply two different commodities. Decisions concerning consumption and pro­duction levels are made ‘now’.

Time implies uncertainty, and the uncertain future poses intractable problems for any theory of rational choice. For general equilibrium theo­rists, a further extension of the Walrasian system to embrace transactions in ‘state-contingent claims’ is a possibility. Each transactor is assumed to possess a list of all possible future ‘states of the world’ along with some probability estimates attached to each state. Given initial resource endow­ments, the transactors exchange claims to resources contingent upon specified events. For example, a claim to one kilogram of cocoa in period 3 contingent upon heavy rainfall in Ghana, might exchange in equilibrium for two claims to one kilogram of coffee in period 4 contingent upon no frost in Brazil.

Quite apart from the transactions costs problem mentioned earlier, this effort to achieve a determinate equilibrium in the face of uncertainty encounters even more fundamental difficulties. For the transactors are ‘unboundedly rational’. All possible future states of the world are imagin­able and nothing can occur which has not been imagined. Yet, when the future is concerned, there would appear to be no limits on the agenda of possible events, no boundaries on the contingencies which might be con­sidered. Decision-making in the face of such uncertainty cannot then be rational in the sense of making one best choice in the face of known oppor­tunities. To quote Shackle (1972): ‘it is plain that in order to achieve a theory of value applicable to the real human situation, reason must com­promise with time’ (p. 269).

7. INSTITUTIONS AND INFORMATION

For the purposes of the theory of the firm, the important point about the general equilibrium method is that by effectively excluding time and uncer­tainty from the analysis, all transactions are costlessly and instantaneously reconciled. In this environment there are no institutional structures called firms. The efforts of all individuals are coordinated by a gigantic and complex web of contractual commitments simultaneously entered into. The economy is made up of a myriad of individual contractors, each one in an intricate and complex pattern of interrelationships with every other. As a description of economic life, however, this is clearly not very accurate. Institutions such as firms, clubs, political parties, trade unions and bureau­cracies exist, and their existence, if it is not to be left unexplained or put down to chance, can be viewed as the outcome of the attempts by rational individuals to solve the resource allocation problems which confront them.

If firms help in the process of resource allocation they must represent a response to factors from which general equilibrium theory abstracts. The economy, to use the analogy of Simon (1969) and Loasby (1976), is not like a watch made up of thousands of parts placed separately in an appropri­ate position relative to all the others, but is more equivalent to a mechanism made up of several subassemblies, the operating principles of which may be analysed separately even if their ultimate purposes may be fully under­stood only in the context of the complete item. A system of subassemblies places limits on the number of linkages which must simultaneously be con­sidered and thereby reduces the costs of establishing them.

Firms are formed and survive as an institutional response to transactions costs. In a world of costless knowledge they have no rationale, but in a world in which opportunities are continually being discovered and in which the formation of agreements between individuals is a costly activity, firms may be seen as devices for reducing the costs of achieving coordinated effort. The ways in which transactions costs are reduced and the problems which arise as a result will be discussed in greater detail in future chapters. For present purposes it is sufficient to remember that ‘firms’ are charac­terised by a system of bilateral contracts. Each person comes to an agree­ment with ‘the firm’. In the case of a small business a single proprietor might be the central contractual agent. In more complex cases the agree­ment will be between employees, managers, bondholders or landowners and a ‘legal fiction’ such as BP or US Steel. The firm is a ‘nexus of con­tracts’.10

The nature of this set of contracts is of very great importance. They are not highly specific contracts. They will not normally lay down extremely detailed provisions concerning when, where and how particular tasks are to be performed. When we join a firm as an employee we agree, within certain limits, to do whatever we are asked to do. We agree to be ‘organ­ised’. When we join as a manager we agree to organise resources, and have considerable discretion as to the way this may be done. Contracts, in other words, are imperfectly specified. This lack of specificity derives from the simple fact that the precise details of the actions required of the employees of a firm may be unknown at the time the contract is made. The decision­making process continues through time, and only time will reveal the deci­sions which may be made in the future concerning the best plan of action for the firm. If contracts had to be renegotiated with every small change of policy, the firm as a useful device for allocating resources would disappear.

Within the firm, information is collected concerning opportunities for productive collaboration, on the skills and attributes of employees, on new technical innovations, on the demands of consumers and so forth. This information must be transmitted to the relevant decision-makers who must then choose and implement a plan of action. Resource allocation within the firm is not therefore the outcome of entirely decentralised decisions by individual people in response to their particular circumstances as in a market process. Nor is it the result of simultaneous agreement between all contractors as in a state of general equilibrium. Resources within firms are allocated by the conscious decisions of planners. The market process is replaced in the firm by a planning process. Firms are ‘islands of conscious power in an ocean of unconscious co-operation’ to use D.H. Robertson’s vivid metaphor.11

It is important for readers to recognise that this initial characterisation of the firm will be amended in important respects in future chapters. As presented here, our definition depends upon a clear distinction being pos­sible between a ‘market process’ and a ‘planning process’. Later, we shall question whether a clear dividing line can be drawn, and we will investigate in greater detail the spectrum of contractual relations which ranges from relatively arm’s length market types towards contracts involving more ‘firm-like’ characteristics.12 Certainly it should not be inferred from the above paragraph that firms must be monolithic organisations with highly centralised planning arrangements. As will be seen in Chapters 6 and 7, contractual relations can vary substantially both within and between firms.

The existence of firms suggests, however, that up to a point, at least, a structure of loosely specified and durable contracts with a central agent may have advantages over the market. Groups of people may find it expe­dient to accept these arrangements if they permit the more effective gener­ation and use of information. It can be advantageous to be told what to do if the decision-making processes used within the firm make it possible to coordinate activities more productively than would otherwise be possible.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Institutions and contract enforcement in the firm

1. The Exchange Game

The firm as a method of enabling contractors to adjust to change and of encouraging the discovery of new possibilities for mutually advantageous cooperation will be a major theme running through this book. Alongside it there will be developed a different, though complementary, perspective. In section 2 we supposed that a number of people found themselves stranded in unfamiliar circumstances and we argued that generating infor­mation about the possibilities technically achievable would be their most important problem. Of equal significance, however, is their lack of infor­mation about how far they can trust each other. The possible advantages flowing from cooperative effort through division of labour and exchange were clear enough (sections 3 to 5) but these gains to the group as a whole could only be achieved if people could be relied upon to abide by the terms of an agreement and not to cheat.

Exchange requires that each contractor accepts the right of the other to the resources at present in their possession. Rights to the resources avail­able to people stranded on a desert island would not be well established. No doubt the stranded travellers would apply some of the conventions about property which were familiar in their place of origin. But with no agency to enforce property rights, the danger of a Hobbesian war of all against all would be a real one. Looked at from this perspective, therefore, the asser­tion in section 2 that the four stranded individuals lacked information about technical possibilities is only one part of the story. They also lack conventions upon which they can rely in their dealings with one another. More generally, they require a trade-sustaining culture.

The difficulty in establishing such a culture can be seen by looking at each potential trade in the form of a game. Two people may see a potential advantage in exchanging x for y. Person C (as in the coalition illustrated in Figure 1.2) might therefore suggest that person D should provide 1.5 units of x and that, after receiving the x, C will return in exchange 1.5 units of y. Person D replies that this is an excellent idea but that it would be even better if C would first provide the 1.5 units of y and that, on receipt of the y, person D would immediately return 1.5 units of x. If neither party is able to trust the other, no trade takes place and the payoff is zero. If both are trustworthy and the trade is honestly made, they will each receive some net benefit from the exchange (let us say this has a value of unity to each person). Finally, if one party mistakenly trusts the other, sending the x or y but receiving nothing in return, the payoff to the cheater we may suppose is two while that to the trusting party is — 1. This structure of payoffs is recorded in Table 1.6.

Person C’s payoffs are given on the left of each pair and person D’s on the right. The game is symmetrical in terms of outcomes. The precise numbers attached to the outcomes are not crucial to the discussion, however. It is the structure of the game as a whole which matters. As pre­sented in Table 1.6, the game of exchange is an example of a prisoner’s dilemma. Consider the situation from the point of view of person C. If D cooperates and provides C with the agreed units of x, person C will receive a higher payoff if he cheats. On the other hand, if D cheats and does not send the agreed units of x, it is still true that person C will do better by also being uncooperative. The worst outcome is to be a ‘sucker’. Cheating is therefore a dominant strategy in a prisoner’s dilemma. This is so even though, from a social point of view, the combined payoff to persons C and D from their both being cooperative exceeds that available from the other alternatives.

Many situations in economic life which require cooperative effort can be modelled in the form of a prisoner’s dilemma. In Chapter 2 the problem of providing public goods is briefly discussed, while in Chapter 4 the difficulty of providing incentives to cooperate in a team activity provides the basis for much of the analysis of Part 2. Here it is necessary merely to note the possible responses to the prisoner’s dilemma which have been suggested and which can be seen reflected in the institutional arrangements which have developed. If cooperation is so difficult, how is it that we observe as much cooperative activity as we do? Four possible answers are proposed below, which will receive varying amounts of attention in future chapters. These may be summarised as the development of reputation, the evolution of norms, the use of outside monitoring and penalties, and finally the cre­ation of a cooperative ethos through leadership.

2. Conventions and Norms

The most obvious objection to the simple exposition of the prisoner’s dilemma in Table 1.6 is that it presents trade as a single, never to be repeated encounter between two people who have no knowledge of each other. If, instead, we consider the possibility that persons C and D might repeat their exchange transaction many times in the future, the game changes its nature. Repeated games or ‘supergames’ can be complex to deal with because the number of possible individual strategies rises exponentially with the number of iterations of the game. It is no longer simply a question of whether to cooperate or cheat in the first round, but whether to cooperate or cheat in the second, conditional upon what had happened in the first and so forth. Nevertheless, both intuition and formal analysis confirm that a sufficiently high probability of repeated dealing will provide a framework in which cooperation can take root.13 The strategy which has attracted the most theoretical attention and which has performed well in simulations of repeated prisoner’s dilemma games is ‘tit for tat’.14

A person playing ‘tit for tat’ cooperates in the first encounter and there­after cooperates or cheats according to the behaviour of his or her oppo­nent. The obvious advantage of this strategy is that two people playing ‘tit for tat’ will cooperate with one another in the first and all subsequent rounds while limiting their vulnerability to cheaters. Suppose, for example, that after each play of the game there is a probability of 3/4 that another round will be played (that is, there is a one in four chance that it will be the last). The players do not know how many times they will actually play the game but the expected number of iterations will be four. Even this may be enough to make ‘tit for tat’ worth considering as a strategy. The expected payoffs to the players are recorded in Table 1.7.

Clearly, if persons C and D both play ‘tit for tat’ they receive an expected payoff of four each. They both cooperate in round one and then, because each has cooperated, they continue to do so in future rounds. If both cheat in every round their payoff is zero as before. Where one person cheats and the other plays ‘tit for tat’, the latter is exploited in the first round only – the cheater receiving two and the cooperator -1. In subsequent rounds, both cheat and receive zero payoffs.

A sufficiently high probability of repetition thus changes the structure of the exchange game. When confronted with a certain cheater it remains the best strategy to cheat. But cheating is no longer the dominant strategy. If someone is playing ‘tit for tat’ it is better to respond in kind than to cheat. Assuming that it is impossible to know in advance which strategy particu­lar individuals are going to play, strategy choice will depend upon what probability people attach to meeting a person playing ‘tit for tat’. Let the probability of meeting an opponent playing ‘tit for tat’ be p* and the prob­ability of meeting a cheater be (1 -p*). Our expected return to ‘tit for tat’ will then be

4p* + (-1) (1-p*).

Similarly our expected return to cheating will be

2p* + (0) (1-p*).

Thus, ‘tit for tat’ will give us a higher expected payoff providing that

5p* -1 > 2p*

that is

p* > 1/3.

If the probability of meeting a person playing ‘tit for tat’ exceeds 1/3, it will pay us to adopt that strategy also. Thus, once this critical proportion of ‘tit for tat’ strategists in a population is exceeded, there will be a tendency for it to grow. People will learn that cooperation is in their own interests. It is also evident that the higher the probability of repeat dealing, and hence the higher the expected number of iterations of the game, the lower this criti­cal probabilityp* will be. With the payoffs of Table 1.6, if the chance of any given play of the game being the final one is only 1/100 (and hence the expected number of plays is 100) ‘tit for tat’ would give a greater expected return than continual cheating, even if the chance of finding another ‘tit for tat’ player was as low as 1/99. Such are the enormous potential rewards from finding someone cooperative, that quite large losses in first-round plays with cheaters are worth incurring in the search.

It is therefore possible to tell a plausible story about the evolution of cooperative behaviour.15 Mathematical biologists use the concept of the ‘evolutionary stable strategy’ to describe a strategy which is immune to inva­sion by a group of mutants playing any other strategy (see John Maynard Smith, 1982, Evolution and the Theory of Games). In certain conditions, self­interested behaviour may result in the widespread adoption of the ‘tit for tat’ strategy in trading games. ‘Tit for tat’ becomes what Sugden (1986) calls a convention. Once established, there are powerful forces of self-interest tending to maintain it, even though alternative conventions might equally well have developed. People comply with established norms of behaviour not necessarily because they think these are worthy of respect from an ethical perspective but because they accord with their own selfish interests.

It can be argued, however, that once conventions have become estab­lished they gradually accumulate about them an aura of moral acceptabil­ity. People may begin to follow norms of behaviour not merely because it is in their interests to do so but because they believe these norms have moral force.16 They feel they ought to cooperate in the exchange game and would feel a sense of guilt if they did not do so, even in situations where a single round of the game is all that is expected. Tourists, for example, may be in no greater danger of cheating in a small local hotel than in an international chain that hopes to attract their custom again in a different location. In general, however, the existence of the ‘traveller’s tax’ is not widely doubted, although whether it derives from poor knowledge of local conventions on the part of travellers, or poor adherence to more universal conventions on the part of local inhabitants, is perhaps a moot point.

3. Reputation

According to the argument of subsection 8.2, cooperative behaviour devel­oped because successive rounds of the exchange game are played with the same person. The non-repeated prisoner’s dilemma is a game in which the players know nothing of one another except that, in the absence of a moral imperative, the players are virtually impelled to cheat. Repetition enabled knowledge of an opponent to build up, and cheating could be punished by lack of cooperation in the future. Each person was assumed to develop knowledge about other contractors in the market entirely by personal expe­rience. Those playing a ‘tit for tat’ strategy would remember those who had cheated and those who had cooperated on their last acquaintance. This memory would determine the strategy played in any future encounter with these people. Strategy choice becomes, to this extent, personalised.

Clearly the forces leading to cooperation would be greatly reinforced if information about strategy choice in an encounter with one person were available to other potential transactors thereafter. A person observed cheating in the first round would then know that all future encounters would yield nothing, even if these encounters were with ‘new’ opponents. The new opponents would know that the person played the cheating strat­egy in round one, and would respond with the like strategy in future rounds. People who might otherwise have played ‘tit for tat’ on their first encounter with the cheater will be warned off and will defect. Only a period of coop­eration in the face of defection by others might re-establish a person’s ‘standing’ after the initial decision to cheat.

4. Monitoring and Penalties

To achieve a cooperative outcome given the payoffs in the trading game recorded in Table 1.6, something has to happen to change the structure of the game. In subsection 8.2 the possibility of repetition was enough to produce this effect. An obvious alternative is for some third party to monitor compliance with the agreed deal and to punish a transactor who cheats. If this monitor can, for example impose a ‘fine’ in excess of one on anyone who cheats, the payoffs in Table 1.6 will be such that cooperation is the dominant strategy even in a single encounter.

The monitoring solution is particularly likely where information about behaviour is otherwise poor. In the case of our exchange game it is at least clear to each transactor what strategy their opponent has played and the ‘discipline of continuous dealings’ may then be sufficient to ensure cooper­ative behaviour. In many of the contexts with which we shall be concerned later in this book, however, it may not be possible to tell whether someone has cooperated simply by looking at the outcome of an agreement. Especially when many people are trying to cooperate on some joint enter­prise, apportioning responsibility for the final outcome may not be feasible. Transactors again face a prisoner’s dilemma, but the ‘tit for tat’ repeated game solution will not work. Not only may it be impossible to determine who has cooperated and who has not, but the rational response to this information is not obvious. Do I withdraw my cooperation in the next round when a single other person in the group cheats? Or do I cooperate providing a sufficient number of others do likewise? If the latter, how big does the cheating group have to be before I join them?

Where individual behaviour can be fairly accurately gauged by other transactors in a group, and where punishment can be focused on the non­cooperative person, the forces of spontaneous order may operate to a degree. In Chapter 10, for example, the use of peer pressure and ‘shame’ to induce cooperative behaviour in a profit-sharing enterprise is discussed. The need for a specialist monitor and enforcer plays a major role in much modern analysis of economic organisation, however. The firm as a device for policing and enforcing contracts will therefore be a continually recur­ring theme throughout the book.

5. Moral Leadership

Imposing a sufficiently large fine for cheating may, in principle, turn the exchange game from a prisoner’s dilemma into a game of harmonious coordination. It is usual to see the punishment or fine as administered by some monitor as discussed in subsection 8.4. Casson (1991) argues, however, that, especially where the costs of monitoring are high, it is the task of leadership through ‘moral manipulation’ to associate cheating or slacking with a guilt penalty. Thus, the penalty is psychological rather than material. It may be powerful because it operates even in circumstances where a person’s cheating is not discovered by other people. Obviously, if people feel bad about cheating, more possibilities for beneficial exchange will be realised. A trusting culture sustains trade.

We have already seen in section 8.2 that rules having moral force might evolve over time. People might abide by such rules even when flouting them is in their purely material interests. Casson admits that trust can emerge nat­urally but ‘in many cases it needs to be engineered’ (p. 28). Leaders are in the business of moral propaganda and preference manipulation. Expenditures on guilt-enhancing propaganda are a substitute for monitoring expenditure. Some further discussion of the role of guilt and shame in economic organ­isation will take place in Chapters 10 and 11, but the role of leadership in setting the general moral climate will not be pursued further.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

The process of exchange and the fim

All exchange transactions encounter problems of information and enforce­ment. Consider, for example, the process of building houses. Suppose that a person, A, who, for the sake of convenience, we shall call a ‘he’, wishes to build an extension to his home. One possibility is that he will draw up some plans, submit them to the relevant public authorities, dig the foundations, order the bricks, mix the cement, build the walls, plaster the interior, put in the doors and windows and undertake to install any electrical fitments and plumbing. Elementary economic principles, however, suggest this is unlikely. Recognising the advantages to be gained from specialisation and exchange, person A might instead decide to spend his time in a suitably remunerative occupation and then to purchase the services of specialist help. He could, for example, pay an architect to draw plans, another agent to obtain the necessary planning consent, a bricklayer to build walls and an electrician, carpenter, plumber and so forth to fulfil their respective tasks.

By forming agreements with specialists, person A will gain the classic advantages from exchange. But he has also given himself some problems.

  1. Like a cook consulting Hannah Glasse’s Art of Cookery who is there advised ‘first catch your hare’, A has to find the people who are going to help him. As may be subtly suggested in this celebrated misquote, obtaining the constituent ingredients is not necessarily the easiest part in any process of coordination.
  2. Having located his bricklayer, architect and electrician, A has to form some assessment of their professional competence.  If the bricklayer is more productive  at laying bricks than is person A there should be some advantage in using his or her services, but how does A acquire such information? The existence of some other examples of the brick- layer’s handiwork which A can inspect,  or the recommendation of other satisfied clients, are obvious possibilities. Inspecting  other ser- vices and goods, however, may present greater  difficulties. Person A may never be quite sure that he is not risking life and limb each time he switches on his electric kettle!
  1. With each of his contacts, A will draw up a separate  agreement,  but this will not necessarily be as straightforward as it sounds. Person A knows what he wants to do in a very general sense: he wants to extend his house. The technical details of how this can be accomplished and the options available may, however, be quite beyond him. When he approaches his architect with a request to produce some plans, he therefore confronts a significant problem. He cannot ask the architect to undertake a highly specific and carefully delineated  task, since at this level of  detail  A quite  literally  does not  know  what  he wants. Instead he must ask the architect to act on his behalf. The architect is A’s agent and is asked to make specific recommendations which are likely to satisfy the general requirements  laid down by person A. Proceeding in this way enables A to gain the advantages of specialised advice, but  as an intelligent  and  shrewd individual  he is sure to be beset by a few nagging doubts.

If, for example, A does not like the architect’s suggested plans and does not wish to proceed with the project, will he have to pay a fee to the architect? Clearly the architect is unlikely to agree to waive his or her fee simply because the client is dissatisfied. Such an arrangement would provide an enormous  incentive to person A to dissemble. He would  claim to  see no  merit  in the  plans  whatever  whilst  secretly taking careful note of their contents. The alternative arrangement, however, by which the architect is paid a fee irrespective of the quality of his or her work, is likewise fraught with difficulties, this time from the perspective of person A. Person A may wonder whether the archi- tect has given his problem  more  than  a moment’s  thought, or has perhaps  delegated the case to some assistant  of little talent and even less experience.

Similar considerations will play a part in person A’s dealings with each of the other tradespeople involved in his project. The plumber, for example, cannot  be told in detail how to proceed,  since only the broad  objectives are defined by A. Technicalities  such as the gauge and type of piping to be used, the potential  heat output required  of the boiler, the positioning of thermostats, the siting of the pumps, are all matters upon which A will have to accept the advice of the expert.

The plumber  will be asked to solve these detailed problems  in ways which serve the interests of A. He or she should not install a boiler with the wrong characteristics  simply because he/she stands  to gain from an agreement with the suppliers, but the client will be in a weak position from which to detect such behaviour.

  1. Overcoming the  difficulties of  formulating enforceable  agreements with each individual is an important prerequisite to the success of A’s plans. Of equal significance, however, is A’s ability to coordinate the activities of each of his helpers. To build an extension to a house using specialist help involves many people cooperating together. Only in the simplest cases will the provisions of one person’s contract  be entirely independent  of the provisions  in another’s.  A decision, for example, to lay a concrete floor rather than a wooden suspended floor will influence the way in which the heating  system is installed. Likewise, the  electrician  and  plumber  may  have  to  work  closely together  at various  stages. Thus, A will find it difficult to finalise his agreement with any one person in the absence of agreements with all the others. Stolidly, he contacts first one and then another, asking advice, modi- fying his original proposals  and renegotiating terms until eventually he calculates  that  construction can  begin.  Inevitably  there  will be some  residual  uncertainty about  his plans,  some  unforeseen difficulties  which  will arise  and  which  will result  in  a  continuing process of bargaining. Within rather vaguely defined limits, his crafts- men accept the obligation to be flexible. Outside these limits, they will claim that the job they are doing was not part of their original agree- ment and will therefore wish to renegotiate  terms.

As building starts, A becomes painfully aware that delays and problems in one area have implications for his plans in others. Bricklayers turn up but cannot build because the inspector has yet to see the foundations. Person A nevertheless pays them for their time. The nagging doubts  which afflicted A at the beginning now turn  to serious concern. Indeed, he begins to have nightmares.  In his sleep he sees the extension to his house. Were those gaps in the roof really the latest thing in ventilation?  Is it usual for walls to sway so far in the breeze? In the nearby hotel, his architect and lawyer share a joke over a glass of whisky. It seems they are using his wallet to pay for their drinks. His gaze returns to his extension, only to see the whole struc- ture collapse in a cloud of dust. Across the rubble a shadowy figure advances towards  him coughing and dusting his pin-stripe  suit. The planner  from the local authority serves him with a demolition  order as a result of failure to comply with all necessary regulations.  Person A wakes up sweating. He, at least, has discovered the primary message of this chapter. Whatever may be the potential  advantages of special- isation and exchange, they certainly do not come free.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Contracts and information in the firm

Our fictional story of person A’s building project was designed to highlight some of the difficulties everyone encounters  at some time or other in the process of contracting. It is now necessary to look at the issues involved from a more analytical viewpoint. Perhaps the most important point about the hapless A is that  his problems  all derive from various  forms of infor- mation deficiency. Were information costlessly available  and  all transac- tions costlessly enforceable  most of his worries would be over. Consider now the various points at which A confronts  the problem of his own igno- rance.

1.    Adverse Selection or ‘Hidden Information’

1.1    Examples of adverse selection

The first problem  was that  A did not know the location,  skills or reliabil- ity of the tradespeople he required.  Finding  out this type of information requires time spent in search. More consideration will be given to the ques- tion of search in Chapter 3, but it should  be evident from the discussion in Chapter 1 that  to search exhaustively  – that  is to search until informa- tion is complete – would be to do nothing  else. At some stage the costs of further  search  in terms  of  the perceived opportunities forgone  will out- weigh the benefits in terms of the expected new opportunities potentially discoverable.  Further, certain types of ignorance are in their nature  costly to dispel through search.  Ignorance of  the price of  a very well-defined product can be mitigated  by asking for quotes from an increasing number of sellers, but overcoming  ignorance  of product quality  is more difficult. It may be worth paying more for the services of a more skilled person, but how is person A to tell a good and reliable craftsperson from a poor  one ex ante? Clearly, this problem ultimately derives from the difficulty of pre- cisely specifying what  services are  required in a contract. If  a contract between person A and a craftsperson were clearly and unambiguously specified the ‘reliability’ or ‘skill’ of  that  person  would  not  be an issue. Either the provisions  of the contract are fulfilled and the craftsperson demonstrates sufficient skill or they are not fulfilled in which case the absence of sufficient skill results in a specified penalty.  A skilled and reli- able craftsperson is valuable to person A because he may not have the tech- nical  knowledge  required  to  specify precisely what  is to  be done.  The absence of this knowledge, however, will make it difficult for A to discover the credentials  of his potential workforce.  All craftspeople, skilled or oth- erwise, will have an incentive to overstate  their expertise in the process of negotiation, and  the  truly  skilled will have  difficulty  in communicating their status to the doubting person A.

The problem which we have uncovered here is in fact of very widespread interest in economics. Essentially it relates to any transaction in which one of the parties is better informed than the other. Such transactions are said to be characterised by a structure  of information which is asymmetric. Akerlof  (1970) gives as an example of a market with asymmetric informa- tion; transactions in second-hand cars. It will often  be very costly for a buyer of a second-hand car to determine accurately its true quality. He or she may or may not buy a ‘lemon’ (an American expression for a bad car) but ex ante there is not much that can be done to avoid it. Ignorance  on the part  of buyers will imply therefore  that  both  good and bad second-hand cars sell for the same price. Sellers of these cars, however, will have much better knowledge of their history and characteristics and therefore corre- spondingly  better  judgement  about  the probability of obtaining good or bad service in the future. The upshot will be that owners of good cars will tend to feel that the second-hand market  value seriously understates their (better informed) valuation. The owners of ‘lemons’, on the other hand, are more likely to sell. As the average quality of second-hand cars offered for sale falls, the price that they fetch falls with it, and this accentuates  the ten- dency for only the worst to be offered. It is at least possible to develop a model in which this adverse selection problem is so serious that no transac- tions will take place at all, even though better-informed buyers would stand to gain.2  Note the implied assumption, however, that contracts  cannot  be drawn up in such a way that failure of a car to meet the standards claimed for it would elicit penalties from the seller.

Perhaps the classic instance of adverse selection brought about by asym- metric information is in the realm of insurance.  A provider  of insurance against  some  undesired  contingency  (say  ill  health)  may  be  less  well informed  about  the probability of  the occurrence  of  this event than  the person seeking insurance. The terms quoted by an insurance company will be based upon certain basic pieces of information such as the age and medical history of the person buying the insurance; information which may be obtained  at relatively low cost. This information may not  be detailed enough, however, to distinguish  with sufficient subtlety between relatively good  and  bad  risks. Once more,  people  with good  health  prospects  will regard the insurance terms offered as rather unfavourable while people with bad prospects  will find the terms attractive. The people who are therefore most inclined to take out health insurance are those most likely to require health care. There is an ‘adverse selection’ problem based upon asymmetric information. Ex ante it may be very costly for an insurance company to dis- tinguish between good and bad risks, just as, by assumption, buyers of cars in the last paragraph could not distinguish good and bad cars, and person A in our story found it difficult to distinguish between good and bad crafts- people.

1.2    Adverse selection and ‘reputation’

In this context it is useful to note the importance of ‘reputation’ or ‘good- will’ in markets with asymmetric information. As was seen in the exchange game of Chapter  1, if knowledge  of cheating  becomes widely known  ex post, traders  will have an incentive to remain in good standing.  A seller of second-hand cars who has established a reputation for providing good cars will be able to charge higher prices or establish markets where none existed before. Similarly, person  A found  it expedient  to use craftspeople with a good local reputation. The existence of ‘goodwill’ economises on transac- tions costs by reducing search, thus enabling trade to take place in higher- quality products  and services than might otherwise be possible.

1.3    Adverse selection and ‘signalling’

Another response to the adverse selection problem is to think of ‘signalling’ or ‘screening’ mechanisms.  A signal is an activity which convinces buyers of the quality of a seller’s wares. It is convincing because it is so structured that a seller of poor-quality products  would be irrational to undertake the activity. As Hirshleifer and Riley (1979, p. 1406) express it: ‘Signalling takes place when sellers of truly higher-quality products  engage in some activity that   would  not  be  rational   for  those  selling  lower quality  products.’ Examples of possible signalling devices include the following:

  • Advertising    Advertising  may  be  more  rational   for  producers   of high-quality   goods  than  for  providers  of  low-quality  goods.  By engaging in an expensive advertising campaign, producers are expressing confidence in the ability of their product  to attract and keep customers.  A shoddy  product  that  may fool customers  on  a single occasion  but  for no longer is unlikely to be worth  extensive advertising.
  • Education qualifications    Educational credentials can act as a signal if the marginal cost of education  is lower for higher-quality workers. Offering higher wages for people with certain  qualifications will be most attractive  for those who find these qualifications the least costly to achieve. A more detailed exposition of this mechanism will be pre- sented in Chapter  6.
  • Insurance deductibles    In insurance markets, higher quality risks can signal their status to insurance companies by a willingness to accept a big deductible (that is, less than full coverage).

Signalling may develop in the presence of information asymmetry and adverse  selection.  It  should  not  be assumed  that  the results  are  always socially advantageous. As will be seen later, signalling can lead to a waste of resources. If the signal is costly to transmit  and the private gains to sig- nalling are all derived from someone else’s private losses, the end result will be  a  simple  redistribution  of  income  with  resources  dissipated  in  the process. Where, therefore,  the information conveyed by a signal permits a more effective allocation of resources, signalling may be both privately and socially beneficial. Some signals, though,  may have the opposite  effect. In Chapter 9, for example, we will discuss how ‘signalling’ to the stock market by managers of joint-stock companies might result in a form of ‘short termism’.

1.4    Ignorance of contractual results

Where information remains hidden ex post and the buyer can never really tell the quality of the services he or she has received, the adverse selection problem  is yet more difficult to overcome.  Suppose,  for example, that  A attempts  to draw up a set of state-contingent contracts  with his workpeo- ple. He agrees, for example, that if certain geological conditions  are found to prevail, the foundations will need to be strengthened or extra drainage installed. He agrees that if weather conditions  are unfavourable, construc- tion may be delayed and specified extra expenses incurred and so forth. This type of contract  clearly requires that  A and his workpeople  can agree on what ‘state of the world’ actually  pertains.  If, for example, it proved  very costly for A to verify the correct position, the workers would have an incen- tive to ‘observe’ any state of the world which they felt was most favourable to  their  own  interests.  Where  it is expedient  to  discover  problems,  the worker  will duly discover them,  and where it is inexpedient  they will be ignored. When workers inform person A that they have ‘hit problems’, they are saying ‘I have observed a state of the world which permits me to take the following actions under the terms of our contract  and which commitsyou to extra expenditure.’ In many cases, the problem  will be sufficiently obvious  to  both  parties,  but  in others  person A may  have  to  trust  the worker. Asymmetric information therefore turns out to be at the root of A’s difficulties once more.

In the circumstances of person A’s building project it is perhaps unlikely that this extreme form of information asymmetry  will pose intractable difficulties.  Nevertheless,  there  are  situations  in which  the  quality  of  a service is difficult to assess by the purchaser even after it has been delivered. As will be seen in Chapter 11, this possibility lies behind some recent think- ing concerning the rationale  of the non-profit enterprise.

2.   Moral Hazard or ‘Hidden Action’

A second important problem facing A was that even after striking a bargain he did not know whether the other parties were fulfilling their obligations. The electrician  seemed to have wired his house, but was it safe? Was the architect  actually exerting him or herself on A’s behalf ? Was the plumber using unnecessarily  expensive equipment?  Whereas  under  section 2.1 the problem was that ex ante a buyer might be ill-informed about the qualities of a potential  purchase  or the difficulties encountered by people working on their behalf, the problem  being considered now concerns the difficulty of observing or deducing the actions of the supplier.

Moral  hazard  exists when the probability of a given ‘state of the world’ occurring  is influenced  by one of the parties  to a contract but when the behaviour of  this  contractor cannot  be  observed.  Insurance contracts again  supply  the  classic case. Suppose  that  person  A,  in  return  for  a specified payment now (an insurance premium), promises to pay to person B (who we will assume to be female) another specified sum in the event of person  B being  robbed.  Our  discussion  so far  has  been limited  to  A’s problem of deciding whether B is telling the truth  when she claims to have been robbed or whether in reality the only person being ‘robbed’ is person A. Even when the prevailing ‘state of the world’ is easily verified, however, there remains  the possibility  that  the outcome  was materially  influenced by the activities of B. In other  words, the probability of being robbed  is not entirely independent of B’s behaviour. She is obviously more likely to be robbed if she spends long periods of time away from her house and habitually leaves the door open than  if she installs a system of locks and alarms and never leaves her property unattended. This suggests the possi- bility that  the insurance  contract could specify conditions which commit person  B to take  certain  precautions. Once more,  the problem  of asym- metric  information is confronted, however.  If the contract states  that  B must  always  lock her door  when leaving her house,  how  is A to  know whether this provision  was or was not complied with when B was robbed? Further, although it is easy enough  to think  of  a few basic precautions against robbery, it would be extremely costly to investigate the detailed cir- cumstances of person B in order to establish the actions required of her in every particular. Only  B can  have  the  kind  of  knowledge  concerning specific circumstances necessary to determine  all the options available  to discourage  thieves. Once the insurance  contract is agreed, however, B will clearly have a much smaller incentive to engage in thief-discouraging activ- ities than before.

This general problem of verifying ex post whether a person’s actions have been compatible  with the provisions of a contract  is called the problem of moral  hazard.  In the context  of  insurance  markets,  Arrow  (1962) sum- marises the issue as follows: ‘The general principle is the difficulty of dis- tinguishing  between a state of nature  and a decision by the insured. As a result, any insurance policy and in general any device for shifting risks can have  the  effect of  dulling  incentives’ (p. 145). As we have already  seen, however, these problems  are not confined to insurance  markets.  Any con- tracts drawn up in conditions  of asymmetric information may give rise to moral hazard.  As Demsetz (1969) puts it: ‘Moral hazard  is a relevant cost of producing insurance; it is not different from the cost that arises from the tendency  of  men  to  shirk  when  their  employer  is not  watching them’ (p. 167).

3. Bounded Rationality

The third broad  class of transactional problem  facing person A which we may identify was the simple magnitude  of the potential task of coordinat- ing the activities of his workpeople.  This problem  would exist even were information symmetrical; that is, available equally to A and the people he employs;  and  it is therefore  logically distinct  from  the  issues discussed earlier in this section. Not only were the provisions  of each person’s con- tract interdependent but they would vary with all sorts of possible contin- gencies which might arise as work proceeded. The capacity of person A to imagine all possible future contingencies and then process the information required to allow for these different contingencies in the contracts  of each person  he hires is obviously  limited.  Person  A faces, in other  words,  a problem which is now usually referred to as ‘bounded rationality’.

The idea of ‘bounded  rationality’  is especially associated with the work of H.A. Simon (1957, 1969, 1979) and O.E. Williamson (1975, 1985). Both writers  use  the  example  of  the  game  of  chess  to  illustrate  the  issues involved. Given the rules which govern the movement  of the pieces on a chessboard, we might in principle consider constructing a list of all possi- ble games. We might  start  by recording  all possible opening  moves and then, for each one, record all possible legal responses, and so on, until we have built  up an entire  ‘decision tree’. The problem,  of  course,  and  the factor which prevents chess becoming a totally trivial pastime,  is that  the decision tree would be of such size and complexity that it beggars the imag- ination.  Even  the  best chess players  must  make  their  decisions  in  the absence of  a complete  list of  future  contingencies  which might  possibly flow from them, and resort must be made to a limited set of considerations which experience has suggested are important.

If rationality is conceived as selecting the best possible course of action for achieving a specified objective, chess moves evidently do not qualify. Yet most  people  would  baulk  at  describing  chess  decisions  as  irrational. Indeed,  chess is widely regarded  as the board  game requiring  powers of reason in the highest degree. Chess problems  are susceptible to the appli- cation of reason, but the complexity of the game is such that decisions are effectively taken under conditions  of uncertainty. This is why Williamson (1975, p. 23) argues that ‘the distinction  between deterministic  complexity and  uncertainty is inessential . . . As long as either uncertainty or com- plexity  is present  in  requisite  degree,  the  bounded  rationality problem arises.’

4.  Asset Specificity and ‘Hold-up’

There is a final contractual problem  which plays an important role in the transactions costs theory  of economic organisation. Where situations  are complex and contracts  are not absolutely ‘cast iron’ and perfectly enforce- able, there will be room for adjustments to contractual terms over time in response  to  changes  in  the  bargaining  power  of  the  contractors.  This problem is particularly serious when assets become ‘transaction specific’.

Consider once more the relationship between person A and his architect. It is likely that, over time, the architect has accumulated all sorts of special information about  the circumstances  and preferences of A, the nature  of his property, and the engineering and other difficulties associated with con- struction  work in the area. Were A to change his architect, the new person would not have access to the same stock of information and might, there- fore, be much less effective whilst undergoing  the initial process of ‘learn- ing’ about the background. The existing architect has an advantage over an outsider  in serving person A – a so called ‘first-mover advantage’.  Notice that, by assumption, person A has financed the accumulation of this useful information by paying the architect a fee at least as great as could have been achieved elsewhere during the initial period of their association, but the information is in the form of ‘human capital’, and the human form is that of the architect,  not person A. The architect,  realising that  he or she is a much  more  productive   resource  than   any  outsider,   may  therefore   be tempted to raise his or her fee. Person A will not like this, but may be pre- pared to pay some increase because the alternative of employing a new architect carries an even higher cost.

The architect  is more productive  than  any outsider,  and  this return  in excess of what would be achievable using alternative  human  resources is a form of ‘rent’.3  There is a surplus associated  with the use of the existing architect, a surplus accruing to a type of specific capital (specialised knowl- edge), and there is a danger that much effort can be taken up in a fight over who should receive it. Although person A has financed it, the architect may attempt to take some of it by implicitly threatening termination. If person A succumbs to this threat,  he is the victim of ‘hold-up’.

It is not necessarily always the buyer in a transaction who is vulnerable to ‘hold-up’, however. Suppose, for example, that A has such eccentric ideas that the plumber has to invest in material and equipment which he or she is most unlikely ever to find a use for again and which has a very low value on the second-hand market. Once the plumber has invested in these assets, s/he in turn will be vulnerable to contract  renegotiation. They are transaction- specific physical assets, and any return above their value in alternative  uses is again a ‘rent’. Person A may try to push down the agreed price of the work thus appropriating some of the rent which accrues to these assets. The plumber  will feel aggrieved but may be in a poor  bargaining  position  for resisting some adjustment to contract  terms. Any return  on the assets in excess of their value elsewhere is, in principle, better than terminating the contract.

In  practice,  of  course,  these  problems   are  unlikely  to  be  of  great significance in a local community where reputational effects are significant. Further, a simple solution to the specific physical assets problem is for the plumber  to  insist that  person A should  finance  them  in the  first  place. Person  A will buy the specialised equipment  and the plumber  will use it when undertaking the work. The plumber cannot ‘hold up’ person A, since a threat to raise the agreed fee will be met with the use of another  plumber. Person A cannot ‘hold up’ the plumber because a threat to lower the fee will result in the plumber working elsewhere. In other words, there is no longer a ‘rent’ element associated with the plumber’s services. The rent accrues to person A’s services as a provider of capital to himself, and it does not matter whether the return is seen as accruing to the provider or the user. That par- ticular transaction has been ‘internalised’.

Although there  are circumstances,  therefore,  in which asset specificity poses no great problem,  vulnerability  to ‘hold-up’ in general plays a large part in the theory of economic organisation. The existence of transaction- specific assets (either human  or physical) will play a significant role in our discussion of the firm’s relationship with its employees (Chapter  6), its sup- pliers (Chapter  7) and its financiers (Chapter  9). It is also associated with the question of the viability of cooperative and other forms of enterprise (Chapter  10). As will be seen, the modern approach to the firm emphasises that  its capabilities  are highly specific and related to the information and experience accumulated in its membership.  This implies that  a significant part of any firm’s return is rent on its specific human assets. A high degree of trust is required if these rents are not to be dissipated in distributional squabbles.  Casson  (1991) argues  that  it is a crucial role of  the business leader to engineer this trusting  environment. We will concentrate more on the  power  of  repeat  dealing  and  reputation. There  can  be  no  doubt, however, that business structure both reflects the degree of trust that exists between transactors and influences its future development.

In the following section, consideration is given to a number of different institutional responses to the problems  we have been considering.  Search costs,  asymmetric  information leading  to  adverse  selection  and  moral hazard,  bounded  rationality, and vulnerability  to  ‘hold-up’  are  factors which  inevitably  influence  the  ways in which  people  contract  with  one another. If gains from trade are potentially  available, it might be expected that institutions will be developed to facilitate their realisation, and this will involve mitigating the effects of some of the forces which stand in the way.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Institutional responses to transactions costs of the firm

Information is costly to obtain. Finding out about the opportunities which are potentially available, about the quality of the goods and services on offer, and about the appropriate responses to various possible future con­tingencies, involves time and effort. The absence of information, as we have seen, can inhibit the process of exchange, but if the costs of acquiring the relevant information are too great the failure of exchange to occur is both predictable and efficient. This was the essential bone of contention in the celebrated exchange between Arrow (1962) and Demsetz (1969) already referred to in the section above. To observe the failure of exchange to take place is not to prove inefficiency if information costs exist. To assert other­wise is, according to Demsetz, to commit the fallacy of the ‘free lunch’ (that is, to assume that information is potentially available without cost). Alternatively, the fallacy involved might be described as the ‘people could be different’ fallacy (that is, that somehow or other people could be induced not to exploit asymmetries in information and that the problem of moral hazard might go away).

As we have taken some pains to elaborate, however, the process of exchange inevitably entails overcoming these transactional difficulties. To assume away the problem of information is not very helpful if the object of study is the theory of the firm, but it would be equally unhelpful to assume that no attempts are made to mitigate the problem. If exchange is poten­tially advantageous we would expect the would-be transactors to devise mechanisms which enable it to proceed. Institutions will develop which economise on information costs and permit trades which would otherwise be impossible to take place.

1. Money

The idea that institutions develop in response to transactions costs is famil­iar enough in certain areas. All students learn at an early stage the advan­tages associated with the use of money as a medium of exchange relative to a system of barter. Elementary textbooks will usually contain examples of the difficulties faced by a fisherman vainly searching for a cobbler who wants a piece of haddock, to use D.H. Robertson’s example. The problem of establishing a ‘double coincidence of wants’ is a slightly misleading way of looking at the issue of barter, however. A barter system – that is, a system involving the direct exchange of goods and services without the use of money – does not strictly require a ‘double coincidence of wants’ unless it is stipulated that all exchange transactions must be bilateral ones. A fisherman can obtain shoes from a cobbler under a system of barter even if the cobbler has no taste for fish, but to do so it will be necessary to involve other parties in a joint multilateral agreement. For example, a baker may agree to provide a certain quantity of bread to the cobbler. In exchange the fisherman provides the baker with fish and receives shoes from the cobbler. Finding the parties willing to take part in this ‘triangular trade’ and nego­tiating the terms of the contract, however, is clearly going to be more costly and require greater information than a simple bilateral deal. In principle, it is possible to envisage more and more individuals taking part in these mul­tilateral negotiations, but, as discussed in Chapter 1, obtaining a simulta­neous agreement between many contractors is likely to be extremely difficult.

Money enables contractors to escape from the requirement of forming agreements simultaneously. A complex pattern of exchange relationships can instead be entered into through a sequence of bilateral arrangements. To return to our simple example, the shoemaker might accept fish in exchange for his shoes even when he dislikes fish if he knows of a baker who will be happy to exchange bread for fish. In this case, the fish is being used in the form of a medium of exchange since its value to the shoemaker depends entirely on its ability to procure him something else. Clearly, if the agreements described here are not entered into simultaneously the cobbler will want to be confident of the willingness of the baker to accept fish, and it is apparent that this is not necessarily or usually going to be the case. A medium of exchange will be more acceptable to the shoemaker the more widely acceptable it is known to be to other people. Where confidence in the wide acceptability of a medium of exchange is strong, it will not be neces­sary for the shoemaker to have knowledge of the demands of any particu­lar baker. Any baker he knows will be happy to supply him with bread in exchange for money. This view of the origins of money is particularly asso­ciated with Carl Menger.4

As each economising individual becomes increasingly more aware of his eco­nomic interest, he is led by this interest, without any agreement, without legisla­tive compulsion and even without regard to the public interest, to give his commodities in exchange for other, more saleable, commodities, even if he does not need them for any immediate consumption purpose.

This passage from Menger effectively describes the evolution of what Sugden (1986) calls a convention. As we saw in Chapter 1, conventions can develop gradually over time in response to the self-interested choices of contractors. People accept money because everyone else does, and a par­ticular form of money may survive for long periods even if a different type, and hence a different convention, might have had better properties. Menger does hint, however, that ‘more saleable’ commodities are the ones that con­vention will tend to favour as money, and this raises the question of what is meant by that phrase. Alchian (1977a) provides a persuasive answer.

Imagine a world in which four commodities exist; Alchian supposes that these are called oil, wheat, diamonds and ‘C’. He notes that people cannot be expected to have expert knowledge of the characteristics of all the com­modities in which they trade. Transactions costs, he assumes, will be highest when two ‘novices’ trade together, and will be lowest when the two traders are ‘experts’ in both commodities traded. We have already noted in our earlier discussion of asymmetric information that traders in certain prod­ucts will have to establish a ‘reputation’ for trustworthiness if exchange is to take place. Let us suppose that by specialising in the trade of wheat in the sense that every exchange involves either its purchase or sale, a person becomes both an expert in assessing its quality and comes to command a high reputation. Such a reputation in a single commodity will be of little use, however, if the ‘expert’ wheat dealer is faced with the problem of assessing the qualities of oil or diamonds in the process of exchange. What is required if transactions costs are to be substantially reduced is a com­modity in which everyone is an ‘expert’. Such a commodity will be one the qualities of which are very easy to assay. This is the primary characteristic of ‘money’ and is implicit in Menger’s use of the term ‘more saleable com­modities’ as a description of money. Commodities are ‘more saleable’ if large numbers of people find their qualities can be assessed at very low cost. The use of money as an exchange medium reduces transactions costs because, in conjunction with the existence of specialist traders in other commodities, it increases the knowledge possessed by each contractor involved in an exchange. The specialist wheat trader will be an ‘expert’ in both wheat and money, while a customer (either a buyer or seller of wheat) will at least be an ‘expert’ in money. Of course, this in no way alters the fact that most of the specialist traders’ customers will be novices in wheat, and the implications of this have already been considered in the context of second-hand cars. In the absence of money, however, both parties to an exchange would usually be novices, while it is the use of money which permits the growth of specialised traders whose accumulated expertise and pursuit of goodwill are a response to the twin problems of asymmetric information and adverse selection.

Although a person accepting money in exchange for some good or service will feel confident that it can be used to procure other things, the precise terms of any future trades will not be known with certainty. Money is accepted in the expectation that it will permit the achievement of desired ends. It may even be the case that money is accepted with no specific and determinate ideas as to what is to be done with it. Instead, the person may prefer to wait upon events, and spend time searching for suitable opportu­nities. By reducing transactions costs, in other words, money permits wider search, and a more extensive and complex system of exchange transactions can occur than would otherwise be possible. Pigou (1949, p. 25) likened money to ‘a railway through the air, the loss of which would inflict on us the same sort of damage as we should suffer if the actual railways and roads, by which the different parts of the country are physically linked together were destroyed’. Money, that is, like the transport system, enables a wider range of transactions to take place.

But the existence of money implies more than a simple widening of pos­sibilities. It implies that these possibilities are discovered by a process which continues over time. People hold money speculatively in the hope that to commit themselves later will be advantageous compared with deciding on a course of action immediately. The cobbler, for example, might have bartered his shoes immediately for fish. In fact, he preferred money because he expected to be able to use it later in ways yielding him greater satisfac­tion. The fact that in many everyday cases the time period involved might be quite short, and the cobbler may have intentions concerning the use of his money which are usually and routinely realised (he is very sure of the terms on which he can buy bread at the bakers) in no way changes the general principle.

Ignorance inevitably restricts exchange. Institutions which help to over­come the problems posed by ignorance are therefore expected to take root. To quote Loasby (1976): ‘Money, like the firm, is a means of handling the consequences of the excessive cost or the sheer impossibility of abolishing ignorance … both imply a negation of the concept of general equilibrium in favour of the continuing management of emerging events’ (p. 165).

2. Political Institutions

The idea that a fundamental political institution such as ‘the state’ might be interpreted as a means of overcoming impediments to the process of exchange has a long history. Consider for example a celebrated passage from Hume:

Two neighbours may agree to drain a meadow, which they possess in common: because it is easy for them to know each other’s mind; and each must perceive, that the immediate consequence of his failing in his part, is the abandoning of the whole project. But it is very difficult, and indeed impossible, that a thousand persons should agree in any such action; it being difficult for them to concert so complicated a design, and still more difficult for them to execute it; while each seeks a pretext to free himself of the trouble and expense, and would lay the whole burden on others. Political society easily remedies both these inconve­niences . . .5

The focus of attention here is on the problem of ‘public goods’ (Samuelson, 1954; Musgrave, 1959). Some goods confer benefits not merely on a single consumer of the good but on a whole population of consumers simultane­ously. Standard examples include defence, public health provisions, the ser­vices of lighthouses and so forth. Pure public goods are said to be ‘non-rival’ in consumption and ‘non-excludable’ (that is, it is technically not possible, or at least enormously costly, to prevent any individual person from enjoying the benefits of a public good provided by others). The result of these two characteristics is that ordinary market processes ‘fail’ in the sense that a multilateral agreement which might potentially benefit all the parties to it will not emerge spontaneously. In Chapter 1, the costs of simul­taneous multilateral contracting have already been discussed in the context of private goods. There it was argued that although simultaneous agree­ment would be impossibly costly to achieve, alternatives existed which would be preferable to completely independent activity for the parties con­cerned. Resources would be allocated not in a single all-embracing moment of universal agreement but in a process involving bilateral exchanges and the use of money or the forming of institutions such as ‘firms’ to manage events as time advanced. Public goods, however, present us with a severe problem in that apparently the only alternative to a widespread multilateral agreement is no agreement. Resolution of this dilemma requires the exis­tence of the ‘productive state’ and the institution of collective processes in place of market processes. Buchanan (1975, p.97) re-emphasises Hume’s point if in somewhat different style:

Only through governmental-collective processes can individuals secure the net benefits of goods and services that are characterised by extreme jointness efficiencies and by extreme non-excludability, goods and services which would tend to be provided suboptimally or not at all in the absence of collective- governmental action.

Public goods present obstacles to the formation of agreements of a partic­ularly intractable nature, but they are not in principle different from those difficulties discussed in detail in section 2.2. There it was seen that the failure of transactors honestly to declare information could conceivably totally inhibit the development of certain insurance or other markets. Second-hand-car salesmen would always maintain that their wares were more reliable than they really were, and purchasers of insurance that the risks they faced were less than they really were. People, in other words, cannot be expected to declare honestly and voluntarily information which adversely influences the terms upon which they will trade when there are no cost-effective means of verifying the information. Trade in public goods is no exception. Further, where the simultaneous agreement of large numbers of individuals is involved, the problem of ‘bounded rationality’ cannot be overlooked. Thus, Hume’s two ‘inconveniences’ which, he argues, political society remedies, amount to the problems analysed earlier: ‘bounded ratio­nality’ (‘it being difficult for them to concert so complicated a design’) and an extreme form of information asymmetry leading to opportunistic behaviour (‘each seeks a pretext to free himself of the trouble and expense’).

3. The Firm as a Nexus of Contracts

A provisional rationalisation of the emergence of ‘firms’ was suggested at the end of Chapter 1. From the standpoint of economic theory, the firm represented a ‘nexus of contracts’ so framed as to provide flexibility in the face of unpredictable events. Uncertainty, and the resulting difficulty of precisely specifying the terms of each person’s contract, thus constituted the starting point for the theory of the firm. This approach has its origins in a celebrated paper by Coase (1937), and the discussion of transactional difficulties above (section 2), now permits a further appraisal.

For Coase: ‘The main reason why it is profitable to establish a firm would seem to be that there is a cost of using the price mechanism’ (p. 336). Coase is here referring to such matters as ‘negotiating and conclud­ing a separate contract for each exchange transaction’ (p.336). In the absence of a firm, each factor of production must contract with every other factor whose cooperation is required. Within the firm, each factor negotiates a single contract. In an extreme case where V individuals must all cooperate closely, a set of —1)/2 bilateral contracts would be required to bind the parties together. For five individuals, Figure 2.1 illus­trates that ten agreements would be necessary. In a slightly different context, Williamson (1975, p.46) refers to this as the ‘all-channel network’. In the firm, on the other hand, one person would become the central contractual agent and a total of four contracts would be sufficient to link all the parties together.

It is important always to remember, however, that the advantages of the firm in terms of savings in contracting costs presuppose conditions of uncertainty. With costless knowledge there would be no advantages accruing to a reduced number of transactional bonds, since these bonds would be costless to establish. In Coase’s view of things the firm economises on transactions costs because bargaining over what has to be done, and on what precise terms, does not take place. The firm is charac­terised by the conscious organisation or direction of resources over time: ‘When the direction of resources (within the limits of the contract) becomes dependent on the buyer in this way, that relationship which I term a “firm” may be obtained’ (p. 337). Within the firm, people do what they are told to do.

More recent contributions to this literature have modified Coase’s con­ception in important respects. Below we look at several ‘contractual modes’ and discuss what distinguishes ‘market-like’ from ‘firm-like’ arrangements.

3.1.The deterministic contract

Under this scheme the contractors agree to perform specific services at certain future points in time. The obvious problem is that, in the extreme, it completely lacks flexibility and presupposes that the contracting parties are able to determine exactly what they will require at all relevant times in the future. It will be recalled that person A found it difficult to formulate a deterministic contract with his craftspeople because of the problem of pre­dicting exactly what would be required of them at each stage.

This type of contracting is only likely to be observed, therefore, where the duration of an association is expected to be short, where conditions are stable and predictable so that routine procedures can be adopted, and where problems of information asymmetry are not serious.

3.2.The state-contingent contract

In a state-contingent contract, obligations are no longer deterministic and fixed. The requirements vary with the ‘state of the world’ which occurs. An example of such a contract was discussed in section 2. There it was argued that the specification of such a contract would be enormously time­consuming and complex. Ultimately, it would encounter the problem of ‘bounded rationality’; the sheer impossibility of imagining all the future contingencies which might arise along with the appropriate responses to them. We also noted that information about the ‘state of the world’ might be distributed asymmetrically between the contractors, thus leading to problems of ‘adverse selection’ and ‘moral hazard’.

3.3.Sequential spot contracting

Instead of a single contract, deterministic or probabilistic, involving com­mitments in future time periods, the parties involved in a project might con­tract period by period. As time gradually reveals what has to be done, the contracts are drawn up and the specified tasks are accomplished in a sequence. This solution faces the objection emphasised by Coase that the number of contracts required to accomplish a given objective will be very large if they are continually renegotiated over time. The number of con­tracts, however, will not be the only problem. An equally important issue is the cost of establishing each one. In principle we might imagine the process of recontracting as a relatively simple operation. The buyer of labour ser­vices, for example, would ask a worker to do something at the established wage rate and acceptance of this ‘request’ would imply that the ‘contract’ had been duly renegotiated. According to this view, associated with Alchian and Demsetz (1972), the type of contract observed within a ‘firm’ is not a single long-term contract of employment involving ‘direction’ of resources by the buyer, as claimed by Coase. Instead, there is an implicit process of continual renegotiation as in a system of sequential spot con­tracting.

3.4. The contract of employment

Where the process of recontracting operates as smoothly as suggested above there would be no advantage attached to a contract of employment. Indeed, it would arguably be extremely difficult in practice to distinguish between the two methods of contracting. Whether an ‘employee’ is consid­ered to ‘renegotiate’ his or her contract continually, or is seen as accepting contractually permissible ‘instructions’, may not in some circumstances be a matter of very great practical importance. Analytically, however, the dis­tinction is significant. The Coasian view of the employment contract requires that it is possible to specify a list of ‘acceptable’ tasks from which the employer can choose. Any attempt at such exhaustive listing must, however, confront the information problem. The employer simply will not have sufficient knowledge to draw up a contract of this nature. If, on the other hand, information is revealed gradually over time, and if, further, this information accrues not to everyone equally but to particular individuals (it is ‘impacted’, to use Williamson’s jargon), the crucial problem will be to set an environment in which people have an incentive to act cooperatively rather than opportunistically. Thus the employer is not a giver of instruc­tions, as in Coase’s model, but a provider of incentives. The employee becomes not a passive receiver and executor of orders but an active agent of the employer.

3.5. The agency contract

Under an agency contract, one party (the agent) agrees to act in the inter­ests of another party (the principal). The example of the architect and person A has already been discussed earlier in this chapter. Note that two important features are required to hold if the agency relation is to be inter­esting. Firstly, there must be a conflict of interest. The architect, by assump­tion, was interested in giving A’s plans the minimum amount of attention he or she could get away with. Person A, of course, was interested in elicit­ing from his architect the greatest attention that was possible. Secondly, there must be an asymmetry in the information available to principal and agent. Person A may simply not know what actions are possible and how they may affect him. He may not be in a position even to tell what action if any, his agent has taken.

Clearly if there were no conflict of interest, the existence of asymmetric information would not matter. The agent would always choose an action which accorded with the preferences of the principal. Similarly, if the infor­mation available to both principal and agent were the same, the conflict of interest would not matter since the principal would immediately detect any ‘opportunistic’ behaviour on the part of the agent. Where both asymmet­ric information and conflict of interest are present, the problem facing the principal will be to present the agent with a ‘system of remuneration’ some­times called a ‘fee structure’ or ‘incentive structure’, which will provide the principal with the greatest payoff.

In the case of the relationship between employer and employee, there are obvious parallels with the principal-agent problem. This was recognised by Coase, who, in a footnote in his 1937 paper (p.337), wrote:

Of course, it is not possible to draw a hard and fast line which determines whether there is a firm or not. There may be more or less direction. It is similar to the legal question of whether there is the relationship of master and servant or principal and agent.

The clear implication here is that only a contract of ‘master and servant’, which implies the direction of resources, will be found in the Coasian firm. The relationship of principal and agent would not be compatible with the existence of a ‘firm’. More recent theorists would not accept this judge­ment. Once the problem of asymmetrically distributed knowledge within the firm is recognised, together with the accompanying possibility of moral hazard, it becomes useful to view the employee as an ‘agent’, and the firm as a response to the agency problem. The Coasian insight that the firm replaces a whole system of multilateral contracts with bilateral contracts between employer and employee is maintained, but the nature of the con­tract between employee and firm is no longer seen exclusively in terms of the direction of resources. Whatever the strictly legal position, the econo­mist can argue that perceiving the relationship between ‘firm’ and employee in terms of principal and agent may provide valuable insights into the way resources are allocated within the firm.

3.6. Relational contracting

All of the contractual forms discussed above have one thing in common. The agreements are all well specified and complete. In the spot contract, the deterministic contract, the state-contingent contract, the agency contract or even the Coasian employment contract, the obligations of each of the parties under the contract are clear. An instruction under a contract of employment is either permissible or not. The quality of a product or service traded in a spot contract is clear and unambiguous. The agent’s fee will be related to results which will be easily observed by both parties while actions may not figure in the contract at all (in a sense, all lawful actions of the agent would then be contractually permissible).

In practice, of course, things are never that simple. As we emphasised in Chapter 1, the firm is a device for handling change and coping with the problem of lack of trust. The essence of the firm from this point of view is not simply that it involves fewer contractual linkages than would the market, or even that dealings are repeated period by period. The main point is that bounded rationality inevitably implies that contracts will be incom­plete, and incomplete contracts are vulnerable to opportunism. Williamson (1985) is particularly associated with this view.6 The contracts discussed above were all from the world of ‘classical contracting’ in which clear agree­ments could be formulated and, if necessary, enforced by an outside agency (the state). There are many circumstances, however, in which such enforce­ment mechanisms will be ineffective and the contractors themselves will have to develop a system of ‘governance’. The firm is then a system of gov­ernance for incompletely specified contracts. It establishes a framework in which the benefits from a continuing association can be achieved. Because potential conflicts will inevitably arise over time, procedures are devised to minimise their destructive consequences and induce as much cooperative behaviour as possible.

Contracting within the firm is not ‘classical’ but ‘relational’. Implicit exchanges are going on continuously, but they are not formalised in specific ‘contracts’. People are cooperative because they perceive it to be in their own long-term interests and because they come to trust the governance structures of the firm. Williamson emphasises that governance arrange­ments within the firm will not always be necessary. Where, for example, very specific services are required of someone at a particular point in time, and the quality of these services is easy to define and assay, there is no advan­tage to establishing a specialised ‘governance structure’. The spot market is the arrangement which minimises the cost of the transaction. Even where a continuing association is desirable, a firm may not evolve. Repeat dealing is a perfectly reasonable way of establishing trust and reputation in some market settings, as was seen in Chapter 1. The crucial element, for Williamson, is vulnerability to opportunism deriving from the existence of transaction-specific assets.

As explained in section 2.4, a provider of a service may gain knowledge and experience over time which is specific to a particular buyer. A work­force in a firm can also accumulate similarly specific skills. This is not, of course, undesirable in itself. The problem is simply the transactional one that continual renegotiation of contracts in these circumstances puts the employer in the sort of position faced by person A in our example of the building project. Each person with whom he or she is negotiating is in the possession of skills and information which he/she does not fully share. Further, once individuals or groups of individuals are more productive within the firm than they would be outside, the income generated by their efforts is a form of ‘rent’. Striking a bargain is then inhibited by asymmet­ric information and there is an incentive to act ‘opportunistically’ in pursuit of these ‘rents’.7

For Williamson (1985), therefore, special governance arrangements within the firm develop when specific assets require protection which clas­sical forms of contracting cannot provide. Frequency of repeat dealing is still important, both because it is repetition which generates transaction- specific knowledge and because infrequent dealing would not warrant the development of an expensive governance structure. Further, an ever- changing and uncertain environment favours the evolution of a firm because in very stable and unchanging circumstances, trust engendered by repeat dealing might be a cost-minimising response. Thus, frequent con­tracting with highly transaction-specific resources in an uncertain environ­ment leads to the evolution of ‘unified governance’.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Contrasting views of the entrepreneur

1. The Classical Tradition

To summarise in a few lines the views of a range of writers spanning more than a century from the time of Adam Smith onwards is clearly a haz­ardous undertaking and one which is liable to result in severe distortions to the work of some economists. As a generalisation, however, the classical tradition, at least in England, did not offer a very sophisticated account of the entrepreneur. Indeed this state of affairs has continued in the neoclas­sical analysis of the late nineteenth and twentieth centuries and for sub­stantially similar reasons. Both traditions are concerned primarily with the analysis of the establishment of ‘natural’ or as we would now say ‘equilib­rium’ prices. The emphasis on the final state rather than the process of getting there inevitably diverts attention from the distinctive contribution of the entrepreneur. Classical analysis recognised the role of superinten­dence and organisation in economic life; and, in the conditions prevailing in the late eighteenth century, the provider of capital and the organiser of production would usually be the same person. The result was a tendency to muddle together some sources of income and to reduce the significance of others. Thus the English classical writers, including Smith, Ricardo and Mill3, used the word ‘profit’ to describe the total return to the provider of capital even though this included elements which might more properly be termed ‘wages of management’, ‘interest on capital’, ‘monopoly rents’, ‘windfalls’ and so forth.

It is to the French classical tradition we must look for the origins of the idea that profit is a type of income quite distinct from that received by capital and that it goes to the entrepreneur. The early French contribution in this field is perhaps appropriately reflected in the fact that English econ­omists have come to use the French word ‘entrepreneur’ rather than any English equivalent such as the term ‘venturer’. J.B. Say4, building on the ideas of Cantillon5, insisted that profit was a quite separate category of income from interest, thus establishing the major distinction between English and French classical schools in this area. He did not, however, emphasise as did Cantillon the importance of risk, but initially viewed profit as a wage accruing to the organiser of production. Within the French school, Knight (1921) in his review of theories of profit (p. 25) singles out Courcelle-Seneuil6 as the contributor who most clearly argued that profit was a reward for the assumption of risk and was not in any sense a wage.

Mention should also be made of the German, and especially Von Thünen’s, contribution. Von Thünen7 is most well known for his work on transport costs, land rents and the consequent spatial pattern of agricul­tural land-use around a city. In the development of this analysis, Von Thünen, who is said to have made extensive use of the financial records of his own estates, defines profit as a residual after the expenses of interest, insurance, and the wages of management have been met. By seeing profit as essentially a return for bearing uninsurable risks, Von Thünen was a close forerunner of Knight.

2. Knight

Knight (1921) developed and elaborated the view that entrepreneurs receive a return for bearing uncertainty. His major criticism of theory up to that date was that even those who appreciated the importance of uncertainty were unclear about its nature and implications. One view, for example, was that profit arose from the fact of continuous change and development over time. Knight did not dispute that radical and less radical changes occurred continuously and that these could give rise to uncertainty and hence to profits, but he insisted that change per se was not the issue. If, for example, changes occurred, the consequences of which were entirely foreseeable, no profits would be generated. Perfectly foreseen changes were compatible with a state of ‘equilibrium’ in which no profits would appear. This type of equilibrium is now often referred to as a ‘Hayekian’ equilibrium.8 Change occurs, but, because it is perfectly foreseen, no expectations are ever disap­pointed. A famous example is that of an approaching meteorite, the impact of which will occasion many changes but all of which may be calculated perfectly accurately (fire damage, loss of crops and buildings, and so on). The actual collision of the meteorite with the earth will then produce no profits, since the prices of resources have long since adjusted to the cer­tainty of the impact and nothing unforeseen has occurred. Correct expec­tations, however, are critical to this result. No change is compatible with profits and losses if change is confidently expected. If contrary to all known laws of physics the meteorite suddenly veers away from the earth and dis­appears into outer space, people will have to adjust to this unexpected con­tinuation of the status quo, expectations will have been disappointed and profits and losses will appear.

According to Knight, therefore, it is not change but uncertainty and the possibility of incorrect expectations which give rise to profit. Further, the term uncertainty is used by Knight only to describe circumstances in which reliable probability values cannot be attached to possible future outcomes. Where future events can be assigned probabilities, Knight uses the term ‘risk’ to describe the situation, and argues that the existence of insurance markets will often enable people to avoid such risk.9 True uncertainty cannot be avoided by paying an insurance premium. Important conse­quences follow from the recognition that much of economic life represents a response to the existence of uncertainty. As Knight puts it: ‘With Uncertainty present doing things, the actual execution of activity becomes in a real sense a secondary part of life; the primary problem or function is deciding what to do and how to do it’ (p. 268).

This ‘primary function’ is the entrepreneurial function. The job of decid­ing how various objectives are to be achieved and of predicting what objec­tives are worth achieving devolves on the entrepreneur, a specialist who is prepared to bear the costs of uncertainty. ‘The confident and venturesome assume the risk or insure the doubtful and timid by guaranteeing to the latter a specified income in return for an assignment of the actual result’ (pp. 269-70).

It is not clear from this that the confident and venturesome should employ the doubtful and timid, or that the entrepreneur should own and organise ‘a firm’, although Knight appears to have thought that this neces­sarily follows. This point will be taken up later on. For present purposes, however, the main result of Knight’s analysis is that the entrepreneur’s function is to make judgements about the uncertain future, and the reward associated with this function ‘profit’ is a return to uncertainty bearing. ‘It is our imperfect knowledge of the future … which is crucial for the under­standing of our problem’ (p. 198).

This ‘Knightian’ view of the entrepreneur has a powerful intuitive appeal. Clearly the exercise of entrepreneurship is usually associated with uncertainty bearing and has something to do with imperfect knowledge, but there are equally powerful objections to it. As Schumpeter emphasised (1954, p. 556) if a person is to make a profit from uncertain turns of events, that person will do so as the owner of some marketable resource. Thus, uncertainty is borne by resource owners who have to accept the conse­quences for the value on the market of their resources of unexpected change. If we are prepared to define resource owners as ‘capitalists’ (that is, we suppress the distinctions between land, natural resources, buildings, physical and human capital) then uninsurable risks must be borne by cap­italists. It may be objected that the entrepreneur may borrow from the cap­italist at a fixed interest and that, if so, the entrepreneur not the capitalist bears the risk, but in this case, if the entrepreneur has no independent resources, failure of the enterprise must mean default on the loan and the capitalist is an uncertainty bearer. Where the entrepreneur has other resources which permit repayment of the debt in the event of failure, clearly the entrepreneur bears the uncertainty but only in so far as he or she is also a capitalist.

If uninsurable risk is borne by resource owners, what becomes of the dis­tinctive contribution of ‘the entrepreneur’? Modern analysis of the entre­preneur reverts to and develops Say’s insight that the organisation of production, the combining together of resource inputs, requires skills of a different order than those of routine labour. Knights’ ‘primary function’ of ‘deciding what to do and how to do it’ is indeed an entrepreneurial activ­ity, but it is conceptually a quite separate activity from bearing uninsurable risks, even though the latter may be associated with it. Recent theory there­fore emphasises that the entrepreneur does not merely put up with the con­sequences of imperfect knowledge, but rather reaps the rewards of discovering and using new knowledge. The English tradition, in which industry appears to ‘run itself’ using inputs of capital and labour in an apparently routine and ‘mindless’ fashion, failed to recognise the impor­tance of the coordinator; the person who decides how and what things shall be done as distinct from the person who merely ensures that they are done. In the next section a more careful elaboration of this view of the entrepreneur is attempted using the work of the most influential modern theorist in the subject, Israel Kirzner.

3. Kirzner

In Chapter 1 we spent some time on the elementary task of showing how a community of four individuals (A, B, C and D) could gain through spe­cialisation and exchange. Each person, we assumed, faced different ‘pro­duction possibilities’. If each remained independent and isolated, their consumption patterns would be those of Table 1.1. Total output would be eight units of x and eight units of y. Specialisation, it was found, enabled total output to rise to ten units of x and ten units of y. Agreements to spe­cialise in the appropriate way and distribute the output as in Tables 1.4 and 1.5 represented various ‘solutions’ to the exchange game (the ‘core’). We further saw that the establishment of appropriate prices for x and y would induce people to specialise and exchange their output on the market, and that the final position would be equivalent to a ‘core’ solution. At each stage it was emphasised that the theory as presented did not provide a persuasive account of the exchange process. ‘Core’ solutions could be calculated and equilibrium prices deduced once all the information necessary had been acquired, but there was only a limited discussion of how and to whom this information is made available.

In Edgeworth’s bargaining approach, the ‘core’ solution emerges after a process of haggling involving all the contractors in the market. In Walras’s ‘tatônnement’, the solution emerges from a trial and error sequence of price-setting by the auctioneer. In both cases it is assumed that no final deci­sions concerning the allocation of resources are made until the bargaining or tatônnement processes are completed. As we have seen, this is paradox­ical, for it effectively implies that the information problem has to be com­pletely solved before the stage of actual resource allocation can begin, and yet this solution of the information problem apparently requires no resources itself and hence involves no opportunity costs. These two ideas that contractors in the market must all be fully informed before anything of consequence can happen, and that the achievement of this state of full information does not in itself imply that anything of consequence has already happened, should not be rejected simply on the grounds that they are ‘unrealistic’. Within the framework of general equilibrium theory, and given the objectives and purposes of that theory, they may be perfectly defensible. Further, it is by considering the conditions imposed by the requirements of general equilibrium that the role of the entrepreneur is perhaps most easily appreciated.10

The entrepreneur is central to the process by which contractors come to perceive the opportunities presented by specialisation and exchange. Clearly, if everyone already knows the production-possibility curves and preferences of each contractor, and if everyone is a rational, calculating, individual, one of the suggested solutions of Tables 1.4 and 1.5 will be adopted, but if each person is only partially informed, the acquisition of knowledge about potential gains from exchange becomes the pivotal eco­nomic problem. It is at this point that the ‘Austrian’ tradition in economic theory, here represented mainly by the work of Kirzner, emphasises the role of the entrepreneur. Essentially, an entrepreneur is any person who is ‘alert’ to hitherto unexploited possibilities for exchange. Spotting such possibili­ties enables the entrepreneur to benefit by acting as the ‘middleman’ who effects the change.

3.1. The entrepreneur as a ‘middleman’

Consider once more the four individuals in Chapter 1. The production possibilities facing each were assumed to be:

A: 10 – 4x=y

B: 12 – 3x=y

C: 6 — 2x=y

D: 3 — x = 2y.

Suppose now that another individual (an entrepreneur E) turns up on the scene (there is yet another shipwreck). This individual is washed up with no resources to his or her name but, being alert to new opportunities, s/he rapidly observes that the four inhabitants s/he meets would be much better off if they specialised along the lines we have already discussed and thus coordinated their activities. It will be recalled that the ‘no trade’ consump­tion pattern was as shown in the left-hand side of Table 3.1. Imagine that E first contacts A and persuades A to provide him/her with eight units of y in exchange for two units of x. Person A will just be prepared to do this since by specialising in y production (ten units) and accepting E’s offer, his or her final consumption level of two units of each commodity will be unchanged. E then approaches person B and suggests that the latter should provide one unit of x in exchange for three units of y. Once more, if person B specialises in x production (four units) and accepts E’s offer, his or her final consumption level of three units of each commodity would be unchanged. These are, of course, very ‘hard bargains’ and we might expect in general A and B to benefit from their acquaintance with E. For the moment, however, we will assume simply that E offers to purchase y at a price of \x (that is, sell x at price of 4y), and sell y at a price of |x (that is, purchase x at a price of 3y) and we have seen that this would leave A and

B no better or worse off than before. Person C on the other hand, will benefit noticeably from trade with E assuming he or she faces the same price ratio as B. By specialising in x (three units), C can trade 0.75 units of x for 2.25 units of y. Similarly, person D can specialise in x (three units) and through trade achieve the same position as C. The argument is precisely the same as was presented in Chapter 1 and the entries on the right-hand side of Table 3.1 are a combination of the first two lines of Table 1.5.

The important difference is that, whereas we assumed in each line of Table 1.5 that every contractor faced the same price ratio set by an auctioneer, in Table 3.1 we have assumed that contractors B, C and D faced a different price ratio from contractor A, and that these price ratios were negotiated with an entrepreneur. The entrepreneur in this simple situation acts merely as an intermediary. By spotting that currently unexploited gains from trade existed, the entrepreneur was able to use that knowledge both to realise the gains and to appropriate a proportion of them for him or herself. Each person’s individual benefit (in terms of units of x and y) from the realloca­tion of resources initiated by the entrepreneur is recorded in parenthesis in the last column of Table 3.1. Person D appears to have gained most of all, but this, of course, is the result of our simple assumption that E offered the same terms of trade to all sellers of x. If the entrepreneur could have kept B, C and D apart and negotiated individually with them, then a sufficiently ‘hardnosed’ approach might have diverted a larger proportion of the gains from trade in E’s direction. However, in the particular arithmetical example presented, the entrepreneur gains 0.5 units of x and 0.5 of y. Of the units of y supplied by person A, 7.5 units are in total given to B, C and D; while of the 2.5 units of x supplied altogether by B, C and D, only two units are given to A.

Person E achieves a consumption level of 0.5x and 0.5y. This represents ‘pure entrepreneurial profit’. It has nothing to do with interest payments on capital employed. Indeed, it has nothing to do with a return to any ‘factor’ as conventionally defined. It arises out of entrepreneurial activity and the possession of a particular kind of knowledge: the knowledge that oppor­tunities exist which no one has spotted before. The entrepreneur acts as a coordinator of resources, and his or her profit is taken from the gains in efficiency which accompany his/her activity. Note that the gains to persons C and D recorded in Table 3.1 are not entrepreneurial profits even though they also are part of the efficiency gain resulting from the reallocation of resources. Persons C and D did not spot the potential benefits available from change; they merely responded to the entrepreneur’s offer. Their gain is therefore a type of ‘windfall’; a portion of the increased output which eluded the grasp of the entrepreneur.

An important problem was studiously ignored in the paragraphs above, however. Clearly the entrepreneur does not negotiate simultaneously with all the people playing a part in his or her plans. Had s/he done so, s/he would be unable to offer person A different terms of trade from the others, and the mechanism which enables some profit to be realised would be ineffective. In effect, the entrepreneur’s knowledge would be instantly avail­able to the others and no entrepreneurial profit could therefore be derived from it. As Richardson (1960, p.57) puts it: ‘a general profit opportunity, which is both known to everyone and equally capable of being exploited by everyone, is, in an important sense, a profit opportunity for no one in par­ticular’. On the other hand, if the entrepreneur, who it will be recalled has no resources of his or her own, is to trade with A first, where is s/he to find the units of x required to make an offer? One solution to this problem might involve E persuading person A to supply the eight units of y in advance of E’s delivery of the x. In this case, person A would be acting as a capitalist supplying the entrepreneur with the resources required to test his or her hunch in the market. We would then expect that person A would require compensation both for the delay and for the perceived uncertainty associ­ated with the ultimate delivery of x. Person A would have to take the risk that E’s confidence in her ability to deliver a given quantity of x bya certain date is misplaced and that the entrepreneurial plan might fail. For Knight, as we have seen, this bearing of uncertainty would make person A an entre­preneur, but for Kirzner this is not necessarily the case. E is the person who thinks s/he has spotted new opportunities, and it is E who stands to gain pure entrepreneurial profits if this judgement proves correct. Kirzner (1979) is quite specific on this point: ‘Entrepreneurial profits …are not cap­tured by owners, in their capacity as owners, at all. They are captured, instead, by men who exercise pure entrepreneurship, for which ownership is never a condition’ (p. 94). Where time must elapse between purchase and sale: ‘It is still correct to insist that the entrepreneur requires no investment of any kind. If the surplus …is sufficient to enable the entrepreneur to offer an interest payment attractive enough to persuade someone to advance the necessary funds… the entrepreneur has discovered a way of obtaining pure profit, without the need to invest anything at all’ (Kirzner, 1973, p. 49).11

Of course, nothing guarantees that the penniless entrepreneur will succeed in persuading capitalists to advance their funds, but, as was empha­sised in Chapter 2: ‘These costs of securing recognition of one’s compe­tence and trustworthiness are truly social costs. They would exist under any system of economic organisation’ (Kirzner, 1979, p. 101). All that can be said is that an entrepreneur who is also a resource owner will find it easier to back his or her hunches and benefit from his or her knowledge since these transactions costs can be avoided.12 The entrepreneur lends to him or herself.13

3.2. Entrepreneurship and knowledge

For Kirzner, the entrepreneur is the person who perceives the opportunities and hence benefits from the possession of knowledge not apparently pos­sessed by others. Thus entrepreneurship is central to the process by which information is disseminated throughout the economy. This emphasis on process is distinctive of the modern ‘Austrian’ school of thought, and it was noted in Chapter 1 that the major Austrian criticism of neoclassical equi­librium theory is the implicit assumption of perfect knowledge which underlies it.

It would, however, be totally misleading simply to leave the impression that neoclassical theory has nothing of substance to say about the problem of information. Our discussion of the information requirements underly­ing general equilibrium theory still stands, but neoclassical economists have developed tools which enable them to handle some problems in the eco­nomics of information very effectively. Since the early 1960s, the assump­tion of ‘costless knowledge’ has been dropped and replaced by the idea that knowledge is a valuable good which it is costly to acquire. This opens the way to the study of the behaviour of rational, maximising, calculating agents in the field of acquiring knowledge. A good example of such an approach is the analysis of ‘search behaviour’, a literature which emanates from Stigler’s (1961) paper on the economics of information. Essentially the idea is simply that resources will be invested in search (that is, acquir­ing information) up to the point at which the marginal expected benefits of the information thus obtained equal the marginal costs of obtaining it.

It is worth taking a brief look at the elements of this literature in order to draw out the contrast between the approach of the Austrians and that of ‘standard’ theory to the problem of information. Take the very simplest case discussed by Stigler (1961). Suppose that a person wishes to purchase a certain commodity. In the textbook world of perfect competition, the price of this commodity is known with certainty by every transactor. But in reality, of course, the price quoted by some sellers will be higher than others, and consumers will usually benefit by ‘shopping around’. Imagine now that the consumer knows something about the probability of being quoted a price within certain ranges upon any given enquiry. Indeed, suppose that s/he knows that the distribution of quoted prices is rectangu­lar and that the price quoted may vary between £0 and £1. Such a person might argue that if s/he simply plans to buy from the first person s/he con­tacts s/he will expect to pay a price of £0.5, but, of course, there is a 0.5 probability that s/he will pay a price greater than £0.5. If s/he obtains two price quotes and then accepts the minimum of these two prices, the person will reduce the expected price that s/he pays. The probability of paying more than £0.5 for the item (that is, that both quotes turn out to be greater than £0.5) will be reduced to 0.25, while the probability that at least one of the two quotes is less than £0.5 would be 0.75. Clearly, ‘shopping around’ favourably affects the probability distribution of the minimum price encountered. It can be shown in this case14 that if n is the number of price quotes obtained, the mathematical expectation of the price paid will be 1/(n + 1).

If we now assume that the person intends to buy a quantity q of the com­modity (which for simplicity we take as independent of the minimum price quoted) and that the extra cost of getting one more price quote is a con­stant c, it is a routine minimisation problem to calculate the number of searches which will minimise the expected total cost (expenditure plus search costs) of buying q units of the commodity. The expected total cost E(T) of buying q units of the commodity will be

E(T) = [q/(n +1)] + nc.

The value of n that minimises this expression is given by

n* = (q/c) -1.

If, for example, the person is to purchase a single unit of the commodity (Stigler uses once more the example of a second-hand car!) and if c =1/9 then the first-order condition tells us that two price quotes will be optimal. Clearly the lower the cost of search c, the greater the optimal number of searches (if c = 1/100, n = 9). Not surprisingly, the more units of the com­modity to be purchased, the greater the optimal search effort (with q =100 and c = 1/100, n = 99).

Stigler’s analysis therefore involves the searcher in calculating the best sample size of price offers to choose. Once the problem is solved, the searcher goes out into the market and contacts the requisite number of sellers. This predetermined-sample-size strategy, however, clearly has its dis­advantages. If q = 1 and c = 1/9 and the buyer approaches the first of the two sellers s/he is going to sample and finds that s/he is offered a price of £0.1, it is not at all clear that the person should bother to contact anyone else. Rather than a predetermined sample-size strategy, therefore, it has been suggested that a sequential-search strategy would be more sensible. At each stage, the searcher asks whether, given the known frequency distribu­tion of price offers and the lowest price so far encountered, a further unit of search would be worthwhile.

Assuming that q= 1 and the frequency distribution is rectangular, as above, it is a simple matter to show15 that further search at any stage will not be worthwhile (in terms of reducing expected costs) if

If the most recent price offer is less than V(2c) it is best to accept it and search no more. V(2c) is called a ‘reservation price’. The sequential-search strategy therefore comes down to the calculation of the optimal reservation price. Search then continues until a price below this level is quoted.16

This brief excursion into neoclassical search theory is sufficient at least to uncover the essential rationale of the approach. Neoclassical theory is evidently quite capable of analysing some types of search, but, if this is so, what is it that distinguishes Kirzner’s ‘alert’ entrepreneur who discovers new information from Stigler’s searcher after new information, and is the distinction of any importance?

The idea of a rational investment programme in the acquisition of new knowledge, as suggested by neoclassical search theory, is in some respects rather odd, for it implies that it is possible to estimate the value of new knowledge in advance of its discovery. Presumably, though, this will only be possible if, in some sense, we already know what we are looking for along with the probability of finding it. It is rather as if we are searching for some­thing of which we once had full knowledge but have inadvertently mislaid. Stigler’s searcher decides how much time it is worth spending rummaging through dusty attics and untidy drawers looking for a sketch which (the family recalls) Aunt Enid thought might be by Lautrec. Kirzner’s entrepre­neur enters a house and glances lazily at the pictures which have been hanging in the same place for years. ‘Isn’t that a Lautrec on the wall?’17

For Kirzner, entrepreneurial knowledge is not the sort of knowledge which is the yield to a rational investment policy in search. Entrepreneurial knowledge does not involve resource inputs but is ‘costless’. It arises when someone notices an opportunity which may have been available all along – something which was staring everyone in the face but had somehow escaped their attention (Kirzner, 1979, pp.129-31).18 ‘Why didn’t I think of that?’ is the exasperated cry of most of us when confronted with some simple and effective piece of enterprise. Part of our exasperation derives from the appreciation that we may have possessed all the individual pieces of infor­mation required to perceive the same opportunity. The significance of the information somehow unfortunately escaped us, and we failed to ‘put it together’ to form a coherent and profitable picture. Such self-admonish­ment is quite out of place in the world of neoclassical search. For in that world there are no mistakes and regrets whether deriving from omission or commission. True, a decision having been made, a person may later acquire knowledge which reveals how much better some alternative decision might have been, but, providing that within the context of the knowledge available at the time the decision was correct and providing that investment in infor­mation had been carried to the optimal point, no real ‘error’ can be said to have occurred. The neoclassical world must always ultimately be a world of calculation in which ‘observation’ is taken for granted.

4. Schumpeter

4.1. Entrepreneurship and equilibrium

As we have seen, Kirzner’s approach to the entrepreneur is that s/he is alert to hitherto unexploited gains from trade. At any one time, economic life consists of a complex pattern of exchange relationships. The entrepreneur acts as the catalyst which loosens some transactional bonds and forges new ones. In our simple example above, the entrepreneur was the motive force impelling society towards some ultimate ‘solution’ represented by the figures in Tables 1.4 and 1.5. Once this position is achieved, no further pos­sible entrepreneurial profits are available. By definition, if the allocation of resources is a ‘core’ allocation, no reallocation can benefit any group of people, and hence all the ‘alertness’ in the world will be of no avail in the spotting of further efficiency gains. Thus, for Kirzner, entrepreneurship is associated with disequilibrium, and concerns the process by which the economy moves towards equilibrium.

The very notion of the entrepreneur as a trader and middleman suggests the gradual and incremental approach to equilibrium, as differences in rel­ative prices are spotted and arbitrage takes place. As Loasby (1982) empha­sises, there is a similarity here with Marshall’s19 approach to economic change, with its emphasis on numberless small modifications of established procedures tested out in the marketplace by ‘the alert businessman’. It is the tradition of Mises and of Hayek, who both emphasise the small-scale and ‘local’ character of much entrepreneurship and the dependence of this entrepreneurship on ‘knowledge of time and place’ or ‘tacit knowledge’20 which ‘by its nature cannot enter into statistics’ (Hayek, 1945, p.21).

There is, however, a more heroic conception of the entrepreneur than this. It is easy to see how Kirzner’s approach might lead to the conclusion that virtually everyone acts entrepreneurially, at least to some degree. For Schumpeter (1943), on the other hand, the entrepreneur is an extraordinary person who brings about extraordinary events. In Schumpeter’s view, the entrepreneur is a revolutionary, an innovator overturning tried and tested convention and producing novelty. Such boldness and confidence ‘requires aptitudes that are present only in a small fraction of the population’ (p. 132) and represents a ‘distinct economic function’. This function of the entre­preneur ‘is to reform or revolutionise the pattern of production by exploit­ing an invention or, more generally, an untried technological possibility for producing a new commodity or producing an old one in a new way, by opening up a new source of supply of materials or a new outlet for prod­ucts, by reorganising an industry and so on’ (p. 132).

In order not to give a misleading impression, it should be added that Schumpeter is careful to say that railroad construction or the generation of electrical power were ‘spectacular instances’ and that his conception of entrepreneurship would include introducing a new kind of ‘sausage or toothbrush’. The crucial characteristic is not scale as such but novelty in a technological sense. New products, new processes or new types of organi­sation are thrust upon the world, often in the face of violent opposition. The military analogy with generalship or the ‘medieval warlords, great or small’ (p. 133) is considered by Schumpeter most appropriate because of the importance of ‘individual leadership acting by virtue of personal force and personal responsibility for success’.

Because of this emphasis on the energy of individual entrepreneurs and the introduction of new products and processes, Schumpeter sees entre­preneurship as a disruptive, destabilising force, responsible for cycles of prosperity and depression. He refers explicitly to the disequilibrating impact of the new products or methods’ (p. 132, emphasis added). This is clearly a rather different conception from that of Kirzner. Kirzner’s entrepreneur is engaged in spotting ways of making the best of a given set of technical cir­cumstances. Technology, the state of the arts, of skills and scientific knowl­edge, are a backdrop to, rather than the outcome of, entrepreneurial activities. The possibilities for the full use of available resources in given technical circumstances still have to be uncovered, but this is all the entre­preneur is seen as doing. The production-possibility curves applying to each of the four contractors in our arithmetical example were drawn on the assumption that they represented the outer bound of all the possible points attainable in the prevailing state of technological knowledge available to each individual. These curves would alter with changes in scientific and technical information. Thus, it is usual to contrast Kirzner’s approach with Schumpeter’s by arguing that Kirzner’s entrepreneur will get us to point A in Figure 1.1, a point on the community-production-possibility frontier representing a given state of technical knowledge; while Schumpeter’s entrepreneur is engaged in shifting the production-possibility frontier by instituting innovations. It is in this sense that Kirzner’s entrepreneur gets us to an equilibrium point A, while Schumpeter’s entrepreneur disturbs this position of equilibrium by redefining the technical constraints.

This distinction between two types of entrepreneur, one an equilibrating and the other a disequilibrating force, is not, however, as clear-cut as at first sight it might appear. It would seem inconsistent with Kirzner’s basic phi­losophy to assume that people are aware of all the purely technical possi­bilities available to them, and that their lack of knowledge concerns only the possibilities of benefiting through the process of exchange. Rather, con­sistency requires us to argue that each person will have limited technical knowledge; that over time, whether by accident or design, they will acquire additional technical knowledge; and that entrepreneurial perception will be as significant in appreciating the consequences of newly acquired technical knowledge as knowledge of price differentials or any other objective pieces of information. It is difficult to see why the person who, upon becoming acquainted with the properties of some artificial fibre, realises that, using this fibre, toothbrushes might be made more cheaply than with natural fibre, is acting as a Schumpeterian rather than Kirznerian entrepreneur. If, ultimately, it is the perception of opportunities which defines the entrepre­neur, then Kirzner’s framework must surely embrace the marketing of new sausages and toothbrushes.

To maintain the distinction between equilibrating and disequilibrating entrepreneurs therefore seems to require that we distinguish between the use of ‘new’ technical knowledge, and the new use of technical knowledge which has been known for some time – at least, known to some people. Thus, the person who first manufactures and uses an artificial fibre is Schumpeterian, whereas the people who gradually come to perceive the multifarious possible applications are Kirznerian. This distinction would certainly seem consistent with the ‘flavour’ of the two writers, but whether it is tenable is a question which, for the present, we will simply ignore.

4.2. The fate of the entrepreneur

Given the differences in emphasis between Schumpeter and Kirzner it is somewhat surprising to find one strand of thought common to both.

Entrepreneurial activity serves to render obsolete the entrepreneur. This is perhaps easier to understand in the case of Kirzner since we have already drawn attention to the fact that, as advantage is taken of the available opportunities, the approach of equilibrium reduces the scope for further entrepreneurial insights. Kirzner, it should be emphasised, does not explic­itly predict the demise of the entrepreneur, as does Schumpeter. Exogenous changes in tastes and technology can be relied upon to create continuous opportunities for entrepreneurship, but it does appear to be a characteris­tic of Kirzner’s system that it would run down in the absence of these outside forces.

In the case of Schumpeter, the idea of the entrepreneur as a destabiliser might suggest the conclusion that the commercial exploitation of new inventions could go on indefinitely and with it the distinctive role of the entrepreneur. Schumpeter, however, took a quite different view. The progress of capitalism, he asserted, would eventually reduce the impor­tance of the entrepreneur. The entrepreneur was required initially to over­come resistance to change but now ‘innovation itself is being reduced to routine. Technological progress is increasingly becoming the business of teams of trained specialists who turn out what is required and make it work in predictable ways’ (p. 132). ‘Economic progress tends to become deper­sonalised and automatised’ (p. 133). The giant industrial unit ‘ousts the entrepreneur’ and the specialist instigators of progress eventually receive ‘wages such as are paid for current administrative work’ (p. 134).

This conception is, of course, quite alien to Kirzner since ‘alertness’ to new opportunities could hardly be ‘depersonalised’ as envisaged by Schumpeter. Clearly ‘perception’ as such is not considered by Schumpeter to give rise to any special problems. Progress derives from technological change, and this can apparently develop a momentum of its own. The entrepreneur is required to galvanise the economic system into motion after which, like some material object in Newtonian physics, all resisting forces having been removed, it continues indefinitely along its predicted path.21

Schumpeter’s prediction of the obsolescence of the entrepreneur is one of the most celebrated aspects of his work. It has naturally attracted con­siderable critical attention which cannot be considered here in detail. However, the different interpretations that can be placed on similar obser­vations are startling in the study of economics. Whereas large corporate entities in Schumpeter’s view ‘oust the entrepreneur’, Kirzner sees them as magnets attracting entrepreneurial talent. The corporation is ‘an ingenious, unplanned device that eases the access of entrepreneurial talent to sources of large-scale financing’ (1979, p. 105). It reduces the transactions costs, considered earlier, involved in gaining access to capitalists’ funds.22 Schumpeter observes the large corporation and finds in it an environment unconducive to the survival of the entrepreneur. Kirzner observes the same phenomenon and pronounces it a structure which has evolved to permit a more effective use of entrepreneurial alertness. Further, Schumpeter’s view that, within the corporation, technical progress becomes automatic was greeted even at the time with scepticism if not disbelief in some quarters. One such critic (Jewkes, 1948), with barely concealed contempt for a view which so contradicted his experience of aircraft production and research during the Second World War in the UK, wrote that: ‘Left to themselves, and having no particular reasons for taking risks, teams of technicians will almost invariably bog themselves down without direction or purpose . . . Take away the motive force of innovation – the business man – and the cau­tious and conservative habits of the consumer and technician would roll back over us with deadening effect’ (p. 21).

5. Shackle

Shackle is celebrated not so much for his specific views on the entrepreneur as for his writing on the nature of choice in general. A brief consideration of his work is relevant here, however, because it impinges directly on the issues under discussion. We have seen how, in Kirzner’s conception of things, the entrepreneur perceives opportunities. Shackle would insist that the entrepreneur must imagine these opportunities. This is not to use the word ‘imagine’ in the popular sense, to imply that the opportunities are illu­sions, but rather simply to assert that any choice involves the exercise of imagining a possible future state of affairs. We do not choose between ‘facts’ or ‘certainties’. When we act, it is on the basis of the imagined con­sequences. Even in the simplest possible case of choosing between a loaf of bread and a pint of beer we must choose on the basis of how we imagine our feelings will be when we actually get round to eating the bread or drink­ing the beer. This is so even if this occurrence follows the act of choice by a mere fraction of a second. Once we have drunk the beer we know what it was like, but this information is not relevant to any act of choice. When we approach the bar for our second pint we will of course, remember the expe­rience of drinking the first, but it is strictly not this recollection of past drinks which determines our choice, but the consequent expectation of the future drink. Thus, Shackle is the complete subjectivist. All neoclassical economists are subjectivists in the sense that they accept that my valuation of beer in terms of the amount of something else I am prepared to sacrifice to get another pint is simply an expression of my own subjective prefer­ences. Shackle, however, goes further to argue that the so-called ‘objects of choice’ (the bread and the beer) are not objects at all, but subjective impres­sions about the future: ‘If my theme be accepted, there is nothing among which the individual can make a choice, except the creations of his own thought’ (1979b, p. 26).

Such radical subjectivism has immediate implications for his view of the entrepreneur. Shackle, like Kirzner, is often placed within the group of writers called ‘Austrian’, yet the implications of his philosophy are so sub­versive that by comparison the main Austrian camp appears a haven of conservatism. Kirzner’s entrepreneur gradually uncovers the opportunities presented by objectively given constraints. In the process, s/he takes us to an equilibrium in which all opportunities are finally exploited. If, though, opportunities do not have an objective existence independent of their dis­coverer, but spring rather from each person’s imagination, they cannot be recorded, even conceptually, in a finite list unless the human imagination itself is capable of exhaustion. Thus, for Shackle there can be no state of ‘full coordination’; no equilibrium representing the final resting place of the economy. Underlying Kirzner, there is a form of historical determin­ism, a final destination. Underlying Shackle, there is simply ‘the anarchy of history’ (1979b, p.31). His view is clearly anti-determinist, and rejects the whole concept of equilibrium.23

To refer once more to our arithmetical illustration of the four individu­als, Kirzner’s entrepreneur spots the possibilities presented by four different, but objectively existing, production-possibility curves, but these constraints and the opportunities delimited by them are, for Shackle, simply what people at any given time think they are, and can never have the status of ultimate objective unchanging facts. Tomorrow, person A may imagine new ways of using his or her resources for different or for similar purposes. Either way, the established pattern is upset and new opportuni­ties for entrepreneurship are created. As Loasby (1982b) puts it: ‘Kirzner’s entrepreneurs are alert, Shackle’s are creative’ (p. 119). Shackle rejects con­ceptions of the economic process which ‘rule a line under the sum of human knowledge, the total human inventive accomplishment’ (Shackle, 1982, p. 225).

If comparisons are to be made, Shackle’s entrepreneur has perhaps a greater affinity with that of Schumpeter than with that of Kirzner. The emphasis on innovation on the one hand (Schumpeter) and the creative imagination on the other (Shackle) are closely related.24 Further, the ideas that entrepreneurial activity disrupts equilibrium on the one hand (Schumpeter) and denies the possibility of equilibrium on the other (Shackle), while clearly not formally compatible, nevertheless suggest a similar conception of its impact. In other ways, however, Schumpeter and Shackle are far apart. Neither Schumpeter’s view that only a small propor­tion of people have the qualities to be entrepreneurs, nor his view that the entrepreneurial function is doomed to extinction, seem compatible with

Shackle’s philosophy, for Shackle’s view is ultimately grounded on his response to the most fundamental question of what it really means ‘to choose’. Everyone faces the necessity of choice, and in choosing they exer­cise the entrepreneurial faculty of imagination. Thus, so long as there are human beings and choices to be made, so too will there be entrepreneurs. Shackle’s approach to the entrepreneur, therefore, is not part of a theory of business enterprise as commonly understood, nor is it like Kirzner’s frame­work an attempt to consider the process by which equilibrium is attained; rather it is an integral part of his whole approach to the theory of individ­ual choice.

6. Casson

To be told that entrepreneurship is an inevitable concomitant of the human condition, while important conceptually and philosophically, is light years away from the popular conception of the entrepreneur with which we started. In principle we may accept the case that all decisions are specula­tive, but we may also accept that some decisions are more speculative than others, and that some people are better at making these more speculative decisions than others. If the entrepreneur is ultimately to play a part within a theory of the firm, there seems no escaping the idea that the entrepreneur possesses skills which are special, if not in kind then in degree. Thus, Knight’s entrepreneur is unusually willing to tolerate uncertainty, Kirzner’s is especially alert, Schumpeter’s is ruthlessly capable of smashing the oppo­sition, and Shackle’s is endowed with a particularly creative imagination.

Casson (1982) attempts to synthesise and extend these conceptions of the entrepreneur. His definition is as follows: ‘An entrepreneur is someone who specialises in taking judgemental decisions about the coordination of scarce resources’ (p. 23). Central to this definition is the notion of a judge­mental decision. This, Casson defines as a decision ‘where different indi­viduals, sharing the same objectives and acting under similar circumstances, would make different decisions’ (p.24). They would make different decisions because they have ‘different access to information, or different interpretation of it’. It follows from this definition that an entre­preneur will be a person whose judgement inevitably differs from the judge­ment of others. The reward, then, for an entrepreneur derives from backing his or her judgement and being proved right by subsequent events.

A single chapter does not allow the space carefully to develop every point of comparison between Casson’s view of the entrepreneur and those which we have already encountered. Nevertheless a few observations may help to make clear the distinctive contribution of Casson and at the same time clarify some of the issues discussed in earlier sections.

Casson has one fundamental point of agreement with the ‘Austrian’ the­orists. The entrepreneur’s reward is a residual income not a contractual income, and it is derived from the process of exchange or ‘market-making activities’. For Casson, the middleman is an entrepreneur, just as for Kirzner. Entrepreneurs reallocate resources. To achieve such a resource reallocation they must trade in property rights (see Chapter 4) and if their attempts at coordination (that is, resource reallocation) are successful they will derive a pure entrepreneurial profit. The person who judges that a firm could be reorganised profitably, purchases the firm, changes its operations (by recontracting with the inputs) and sells it for a gain is clearly an entre­preneur. The person who thinks that a group of people, at present working independently, would be more effective as a team, and who forms the team by employing each at a wage equivalent to their existing income, thereby appropriates the productivity gains achievable through team effort. Such a person is also clearly an entrepreneur.25

Even at this level there are, of course, differences of emphasis. Thus, as we have seen, Casson insists that these ‘market makers’ or ‘coordinators’ are specialists, whereas Kirzner sees any alert person as a potential entre­preneur. Further, Casson emphasises that entrepreneurs require command over resources if they are to back their judgements and that this is likely to imply personal wealth. He refers to people with entrepreneur­ial ability but no access to capital as ‘unqualified’ (p. 333). Kirzner would accept that lack of personal capital presents extra transactional difficulties (see above section 2.3.1) but would almost certainly argue that anyone with entrepreneurial talent could never be in an objective sense totally ‘unqualified’. Entrepreneurial talent will find ways of securing control of resources, and ‘alertness’ to possible new ways of doing so is as much a part of entrepreneurial talent as alertness to possible new uses for the resources themselves.

In other respects, Casson and the Austrian theorists are far apart. For Schumpeter, Kirzner and Shackle, the ‘pace of change’ is determined by the activities of the entrepreneurs. Each had different ideas of the personal qualities which were important in instigating change, and they differed on the question of whether change thus instigated was disequilibrating or equilibrating, but each was clear that change and entrepreneurship go together like a horse and cart and that the entrepreneur is the horse. In Casson’s scheme, however, there is a tendency to view ‘the pace of change’ as an accompaniment to entrepreneurial activity rather than as its result. This makes Casson’s entrepreneur more akin to that of Knight – the person who, in an uncertain (changing) world, specialises in making difficult judge­ments and receives a profit for bearing uninsurable risk (Casson would say for exhibiting superior judgement). Indeed, Casson himself writes that his work on the entrepreneur is in many parts ‘simply a reformulation of ideas first presented by Knight’ (p. 373).

Casson’s entrepreneur perhaps requires Kirznerian perception to spot the information most pertinent to the judgemental decision at hand, and requires Shackle’s imaginative faculty to ponder future possibilities, but these features are not greatly emphasised. The reason is straightforward and fundamental. Casson wants to discuss more than the rationale of the entrepreneur. He realises the importance of the entrepreneur to the process of resource coor­dination and wishes to consider further questions upon which an economic theory might cast some light. Ultimately, Casson wishes to construct a pre­dictive theory of entrepreneurship. Shackle’s sublime epistemology is all very well but there are some important down-to-earth issues which it does not really help us to address. Why are some economic systems apparently more successful at resource reallocation than others? How are entrepreneurs ‘allo­cated’ to the task of making judgemental decisions? What institutional arrangements facilitate the exercise of entrepreneurship? What factors deter­mine the ‘supply’ of entrepreneurial talent? The language in these questions, which clearly implies the notion of entrepreneurship as a ‘resource’ similar to other factors of production to be ‘allocated’, is clearly alien to the Austrian conception. Yet, as Casson recognises early on (p. 9), ‘the Austrian school… is committed to extreme subjectivism – a philosophical standpoint which makes a predictive theory of the entrepreneur impossible’. He accepts that no predictive theory of the behaviour of an individual entrepreneur is possi­ble, but this does not rule out, he argues, a theory of the aggregate behaviour of a population of entrepreneurs. Fortified with this thought, Casson braces the hostility of the entire Austrian camp and proceeds to consider entrepre­neurship within a supply and demand framework.

Figure 3.1 reproduces Casson’s diagrammatic apparatus. The curves, though labelled DD’ and SS’ should not be regarded as supply and demand curves as conventionally interpreted. Consider the DD’ curve first. Along this curve is plotted the expected reward per entrepreneur as the number of active entrepreneurs increases. It is drawn assuming a given pace of change in the economy. Thus, new opportunities are cropping up at a certain pace and it is the task of the entrepreneurs to spot them and take advantage of them. Notice how similar to Kirzner’s is this conception. Whereas Kirzner sees the entrepreneur as gradually coming to perceive the opportunities latent in given circumstances, Casson sees the entrepreneur as spotting a certain proportion of the opportunities thrown up as circumstances change. It is as if Schumpeter’s entrepreneurial horse has done its work and the cart is proceeding under its own momentum directed not by teams of experts, as Schumpeter expected, but by specialist Kirznerian entrepreneurs placed in this slightly different environment by Casson.

As the number of active entrepreneurs increases, the expected return to each declines. This is to imply the usual competitive postulate in a slightly unfamiliar guise. The more active entrepreneurs there are, the more likely it is that any given opportunity will have already been spotted by someone else, and the length of time elapsing before a newly spotted opportunity is emulated by others is reduced. Thus the curve DD’ slopes downwards to the right and its position is dependent upon the pace of change. Curve SS’, on the other hand, is the supply curve of ‘qualified’ entrepreneurs (those with access to resources). It has a lower bound at the prevailing real wage on the reasonable grounds that no one will be an entrepreneur if the expected reward is below the wage rate. (This would, of course, not neces­sarily follow if entrepreneurs could be found who were risk lovers.) As the expected return to entrepreneurship rises above the wage rate, ‘qualified’ people desert employment to become specialist entrepreneurs. Further rises in the expected return induce yet others to become entrepreneurs who had before preferred leisure.

The position of the curve SS’ depends on the stock of entrepreneurial talent existing in the population (that is, the number of people who have the necessary judgemental qualities) and the proportion of these who are ‘qualified’ in Casson’s sense of having command over resources. People can become ‘qualified’ in three possible ways. They can have wealth of their own with which to pursue entrepreneurial ideas, they can have social con­tacts with wealthy individuals who know their character and appreciate their entrepreneurial potential, and they can gain command of resources from venture capitalists who do not know them but are specialists in screen­ing for entrepreneurial flair, or from holding a senior position in a corpo­ration.26 Thus, SS’ will shift with changes in the distribution of wealth, changes in social mobility, and changes in institutional mechanisms for screening for entrepreneurial ability.

Intersection of the two curves DD’ and SS’ gives an ‘equilibrium’ solu­tion and determines the number of active entrepreneurs (N) and their expected rewards (V). These long-run ‘equilibrium’ expected rewards to entrepreneurship Casson interprets as a form of wage. In the short run, the return to each entrepreneur is a return to superior (monopoly) knowledge. In the long run, the expected return is ‘simply compensation for time and effort, namely for the time and effort spent in identifying and making judge­mental decisions’ (p. 337). This long-run conception is not unlike that of J.B. Say which we encountered early on in this chapter. The number of active entrepreneurs (N), given the pace of change, will determine the pro­portion of new job opportunities that are exploited.

If this equilibrium or steady state is to be achieved, it is necessary that potential entrepreneurs should know the total number of entrepreneurs operating at any given time, along with the underlying pace of change of the economy. This information will be required if the expected return to entrepreneurship is to be assessed at each point and a decision made about the desirability of entry. It is worth noting that this mechanism will not nec­essarily lead smoothly to the equilibrium point. Suppose, for example, there were Nj active entrepreneurs, all those entrepreneurs in the interval NjN2 would be attracted to enter. Casson supposes that the numbers grad­ually rise and the expected reward gradually falls until equilibrium is achieved, but since we have not assumed that each entrepreneur knows the opportunity costs faced by other entrepreneurs, all we can really tell is that all entrepreneurs in the interval N1N2 will enter. Such a mass entry, however, will greatly depress the expected reward per entrepreneur (V2). In this way, a cycle could be generated not unlike the ‘cobweb cycle’ of prin­ciples of economics textbooks. It is also similar in conception to Schumpeter’s view of cycles of entrepreneurial activity. Clearly the deci­sion whether or not to become an entrepreneur is as ‘judgemental’ as the individual decisions the entrepreneur makes after he or she has entered. It would be ironical indeed to have to conclude that markets in general are coordinated by entrepreneurial activity, but that the market for entrepre­neurs requires an auctioneer.

Casson’s presentation of the market for entrepreneurs has many of the strengths which are associated with neoclassical ways of thinking. Drawing supply and demand curves can be a powerful aid to thought, which is presumably why they were invented and have proved so popular. Such diagrams immediately require us to specify what determines the shapes and positions of the curves, and in so doing we isolate what we think are the important influences on the market. Whether the concept of a steady-state equilibrium in the market for entrepreneurs is a theoretical advance, however, will have to be judged by future research effort. In prin­ciple the framework should permit comparative static properties to be deduced which are testable statistically. In practice it is not clear that some of the crucial variables in the model are amenable to statistical measure­ment; for example, the social and institutional factors behind the supply curve. Further, economists of the Austrian school would argue that the attempt to introduce the equilibrium method into studies of entrepre­neurship is fundamentally misconceived. For the DD’ curve to mean any­thing it has to be assumed that all entrepreneurs know ‘the pace of economic change’ and can calculate the expected rewards from their entre­preneurial efforts. There seems to be no very clear explanation as to why such a construction is likely to exhibit much stability. In the context of a firm’s market demand curve, Shackle (1982, p. 230) defends the geometri­cal figure as an aid to thought, but dismisses the possibility of knowing much about it in practice: ‘It is a mere thread floating wildly in the gale.’ There seems little possibility that he would take a different view of the demand for entrepreneurs.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

The entrepreneur and the firm

We have now considered in some detail the contribution of those economic theorists who have given the entrepreneur a place of importance in their thinking. There are clearly many different conceptions of the entrepreneur, but complex and subtle as are some of the arguments and distinctions that we have encountered, a few fundamental points seem particularly relevant for the theory of the firm. Entrepreneurs are concerned with the process of coordination. It is time and change which give rise to the possibility of entrepreneurial profit. This is so whether we emphasise alertness, imagina­tion, skill in making judgemental decisions, willingness to bear uncertainty or energy to overcome resistance as the ultimate source of entrepreneur­ship. In Chapter 1, much space was devoted to the proposition that a world of perfect information required no firms and that the firm was an institu­tional response to uncertainty. Clearly, therefore, at this very fundamental level the entrepreneur and the firm are closely associated. Both are con­cerned with managing change.

It is important to remember that the entrepreneur is not a prisoner in the firm, and it is interesting that most of the theoretical work in the Austrian school concentrates, as we have seen, on intermediation in the market. This, though, is simply because these theorists are particularly interested in uncovering the ultimate source of entrepreneurial profit, and wish to emphasise their view that all economic life can be seen as a changing network of exchange relationships. Thus, as we saw in Chapter 2, the firm itself can be viewed as a set of contracts with a central authority; contracts so framed as to reduce the transactions costs associated with exchange in the market. Those who spot that possibilities for profitable collaboration exist, who put together the necessary set of contracts between themselves and the collaborating agents, and thus establish a new enterprise or firm, are acting as entrepreneurs in the same way as the middleman in our earlier example. The difference between simple arbitrage and setting up a firm is one of the degree of complexity and the type of contract involved rather than any difference of economic principle. In both cases, resources are real­located, and if the reallocation makes it possible for everyone to benefit thereby the entrepreneur stands to gain a pure entrepreneurial profit. Entrepreneurship is central, therefore, to the establishment of new enter­prises. As Knight recognised, A considerable and increasing number of individual promoters and corporations give their exclusive attention to the launching of new enterprises, withdrawing entirely as soon as the prospects of the business become fairly determinate’ (p. 257).

1. The Small Entrepreneurial Firm

Casson’s supply curve of entrepreneurs is not easy to estimate empirically, but in recent years a significant amount of academic research has attempted to investigate the factors determining the establishment of new entrepreneurial firms. As we saw above, the position of the supply curve is expected to depend upon the prevailing return to normal employment, the judgemental qualities of the population and the information flows which they encounter, and access to finance (especially private wealth or some other means of surmounting the hazards associated with the capitalist- entrepreneur relationship).

In the United Kingdom, for example, Storey and Johnson (1987) have studied regional variations in entrepreneurship. The authors construct an ‘entrepreneurship index’ for each region. Factors such as the percentage of small firms in a region or the proportion of the population in managerial groups can be associated with the accumulation of relevant knowledge and experience on the part of potential entrepreneurs. Educational informa­tion, such as the proportion of the population with degrees, may correlate with the ability of people to draw up and act upon simple business plans. Savings per head, house price indices, and measures of the house-owning population are related to access to capital. Indices of ‘entry barriers’ attempt to measure the height of the capital ‘qualification’ which would have to be surmounted by new entrepreneurs in the industries of the region. Storey and Johnson find that variation between the regions in new firm for­mation rates in manufacturing cannot be explained simply by differing industrial structures. ‘Fertility’, that is the propensity for new firms to be set up, was a more important factor than regional industrial structure. Further, government policies to encourage small firm formation were not equally effective in all the regions.27 Indeed, ‘the entrepreneurship index performs remarkably well in predicting the regional take-up of small firms policy’ (p. 170).

In another paper, Storey and Johnson (1987b) concentrate on the oppor­tunity cost side of the decision to become an entrepreneur. They use an index of labour shedding as a measure of local labour market conditions. Unemployment may reduce the opportunity costs of becoming an entre­preneur although it may also have adverse effects on asset values – both human and physical. Using data on new manufacturing firms in Northern England between 1965 and 1978 they found that it was not the lure of high profitability that influenced new firm formation so much as local labour market conditions.28

Other work focuses on the entrepreneur’s problem of obtaining finance. In the United States, Evans and Leighton (1989) and Evans and Jovanovic (1989) find that switches out of employment into self-employment are more likely to occur as family assets rise. They argue that this evidence is consis­tent with the existence of binding financial constraints deriving from the contractual hazards discussed earlier. Blanchflower and Oswald (1991) also use a neoclassical supply and demand framework to investigate this issue. Data from the National Child Development Study in the United Kingdom were used to estimate the probability of a person being self-employed. The role of gifts or inheritance proved to be significant. ‘A gift or inheritance of £5,000 approximately doubles a typical young individual’s probability, ceteris paribus, of setting up his or her own business’ (p. 22).

At the macroeconomic level, King and Levine (1993) propose a model of the connection between finance, entrepreneurship and growth. In their work, financial intermediaries specialise in evaluating prospective entre­preneurs and thus overcoming the adverse selection problem. Some people have the capacity to be successful entrepreneurs and others do not. Expenditure of resources in the activity of assessment is capable of distinguishing competent from incompetent entrepreneurs. When compe­tent entrepreneurs and potentially successful projects have been identified, the intermediaries raise finance and diversify risk. They do this by provid­ing equity capital in the manner of venture capitalists. The hypothesis is that a more developed financial system will facilitate entrepreneurship and hence will be associated with higher levels of national income and with a higher growth rate. King and Levine use data from over eighty countries for the years 1960-89 to evaluate this hypothesis. Their proxies for the degree of development of the financial system include the ratio of liquid liabilities (currency plus interest and non-interest-bearing liabilities) to GDP, the rel­ative size of central bank assets to total bank assets and the proportion of credit advanced to private enterprises rather than the government sector. Econometric analysis indicated strong links between these characteristics of the financial system and overall productivity growth and growth in per capita output.

Small businesses rely heavily on the banks for the provision of finance. It has been argued that the problems of information asymmetry which con­front entrepreneur and financier will depend to some extent on the nature of the banking system. More discussion of this issue can be found at the end of Chapter 9 in the context of corporate governance. Here we merely note that the commercial banks in the United Kingdom have not traditionally been seen as appropriate sources of venture capital or long-term industrial investment.29 They developed as deposit-taking institutions that would provide relatively short-term finance on good security. No very deep knowl­edge of particular industries or entrepreneurial prospects was required in this system on the part of banks. Quite different traditions evolved in other countries with respect to the channelling of savings to investment opportu­nities. Stanworth and Gray (1991, p. 70) write that ‘by retaining a more sep­arate and isolationist role from industry in the UK, the banks may be less able to provide this transformation process than are their counterparts in other countries’. Entrepreneurs require appropriate supporting institutions. Reid and Jacobsen (1988, p.6) in their study of the small entrepreneurial firm in Scotland, however, do not see the policies of commercial banks as representing a significant barrier to the raising of outside finance by new small firms. Further discussion of the finance of the entrepreneur can be found in section 8 of Chapter 4, where the role of instruments of debt com­pared with instruments of ownership is specifically investigated.

Once the enterprise is established, the scope for entrepreneurship does not cease. In principle, the continuation of the same arrangements, the monitoring of the inputs and routine management are not entrepreneurial activities, but continuing change elsewhere is likely to involve a continuous process of adaptation by the firm, and no firm is likely to survive for long without the exercise of some entrepreneurial talent. It is in this context that Casson’s approach has most to commend it. All firms require entrepre­neurs, and a significant problem is how to make the most of available talent in the making of judgemental decisions. One of the most obvious problems faced by closely owned businesses is what happens when the sons or daugh­ters of the founder lack their parent’s business acumen. Indeed, the growth of transferable shares and limited liability can be seen as a response to this very problem.30

The entrepreneur and the proprietor of a business enterprise are not syn­onymous. Clearly, the single owner of the classical capitalist firm who sup­plies the capital and performs routine managerial tasks may also exercise entrepreneurial skills, but the owner is not necessarily an entrepreneur. Neither should we think in terms of a single entrepreneur associated with each firm. Especially in larger firms, entrepreneurship, alertness to new opportunities, may exist throughout the organisation. Indeed it is possible to regard ‘the firm’ itself as a ‘coalition’ of entrepreneurs.

2. The Firm as a Coalition of Entrepreneurs

Wu (1989) has developed the idea that markets can evolve over time to permit trade in commodities, labour and capital, but that ‘all the services provided by entrepreneurs are nontradeable’ (p. 103). Moral hazard is such a severe problem with entrepreneurial services that a market can never exist and ‘entrepreneurs must take the initiative to organise production through non market means, that is, by organising a firm’ (p.232). In the era of the medieval guilds, a craftsman would supply capital labour and entrepre­neurial talent together. Gradually, however, separate markets in labour and capital developed. For several centuries, entrepreneurship was still associ­ated with the provision of capital, but the development of the joint-stock company has permitted further specialisation. Pure entrepreneurs are taking control of production while capitalists become providers of funds. ‘The long historical evolution toward functional specialisation among the factors of production had reached its destination’ (p. 224).

For Wu, a firm is a coalition of entrepreneurs which agrees a production policy, an organisational structure, and a rule for sharing the residual profits. The entrepreneur’s reward is the result of bargaining within the firm over a share of the surplus. This does not mean that pure entrepreneurs can easily set up a new firm. Start-up firms in risky industries cannot be founded by pure entrepreneurs because of the difficulty of raising funds. Such firms will require innovator-entrepreneurs with private sources of finance, but established firms can generate a sufficient reputation in the market to attract finance, and hence can be managed by pure entrepreneurs.

Further reference to this conception of the firm and the role of the entre­preneur is made in Chapter 8 in the context of corporate governance. It is clearly a controversial view because it implies that market evolution and the power of reputation is enough to overcome the hazards associated with the relationship between entrepreneur and financier. It is also heavily influenced by ‘Austrian’ ideas. The main resources harnessed by each firm are the entrepreneurial talents of the coalition. These will be highly ‘coali­tion specific’ and the surplus generated by the mutual efforts of the entre­preneurs will not necessarily be competed away in the long run. Thus, Wu’s approach has affinities with those who emphasise the importance of the generation of ‘competitive advantages’ within the firm – advantages which cannot be replicated at low cost and derive from the special experience and make-up of a given coalition.

If the firm is a vehicle for the exercise of entrepreneurship we have to get used to the idea that a significant proportion of the income received by those who work in the firm is entrepreneurial profit. This applies not merely to those who have Board appointments, and possess the more obvious claims to profits such as stock options and deferred pension rights, but also to those who work throughout the organisation. The means by which people lay claim to these profits is important. In Chapter 6, for example, we will look at the structure of hierarchies. Promotion can be seen as a way of inducing effort. It might also be part of the mechanism whereby the pure surplus of the organisation is distributed between the entrepreneurs who created it. The important thing is that entrepreneurs have the means of transferring their insights into personal gain. To consider the mechanisms by which this can be accomplished requires us to investigate in much more detail the nature of property rights and the way that different types of organisation reflect different structures of rights.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Introduction of property rights: definition and types

1. INTRODUCTION

In earlier chapters we have discussed at length the phenomenon of ‘exchange’, but until now it has simply been assumed that exchange takes place in goods or services (or in the x and y of our arithmetical example) and that these goods and services are valued because of some physical or technical characteristics. Further progress in piecing together a coherent picture of the firm requires that we refine our concept of what it is that people trade with one another. Occasionally in microeconomics textbooks, the idea is encountered that utility or satisfaction derives not from ‘goods’ in themselves but from their ‘characteristics’. Thus, in Lancaster’s (1966) framework we gain satisfaction not from toothpaste as such but from ‘decay prevention’ and ‘mouth freshening’ qualities which the toothpaste provides. Similarly, in Becker’s (1965) approach, households are ultimately concerned with ‘commodities’ which may be produced by using inputs of various market goods. Thus, a visit to friends may be the desired end (the ‘commodity’) which requires us to use the ‘goods’ car service, petrol, shoe leather and so forth, along with a certain amount of time, if we are to achieve it.

These ideas suggest an even more general proposition. It is not goods in themselves which give satisfaction. It is what people are entitled to do with these goods which really counts. Of course, in the simple case involving an exchange of apples and nuts, so often explored in the economics textbook, the question of the ways in which we could use these physical entities to yield utility barely arises. Even here, however, we might observe that whereas the purchase of an apple entitles us to eat it, or cook it, we are not (say) entitled to propel it through our neighbour’s window, or to drop the core carelessly on the public highway, or to ferment more than certain limited amounts of cider, and so forth. Thus, when people exchange apples and nuts, the physical goods change hands, but that physical transaction is the visible manifestation of something more fundamental. The trade is more correctly seen as an exchange of property rights in the apples and the nuts.

In the case of more complex commodities, the idea is more obvious. The market in the stock of housing, for example, involves the exchange of rights in the stock, and these property rights can be subdivided in such a way that several different people may have different rights in the same physical asset. Consider the legal ‘owner’ of a house. Such a person has the right to occupy the premises (‘usus’); may alternatively let the house to someone else and charge that person (the tenant) a rent (‘usus fructus’); may, within limits, allow the house to deteriorate; or may improve it and change it in a beneficial way (‘abusus’). In all these things, the owner is not entirely uncon­strained. Ownership does not imply being able to do anything we like.

  1. If the owner occupies the premises, there may be limitations on its use. He or she may be forbidden from keeping a caravan in the front garden or chickens in the back, or from painting the windows red. These activ­ities may be forbidden because the original builder or developer of the house, wishing to create a favourable environment and thus to sell the houses for the highest possible price, judged that people will be pre­pared to pay more for houses free from the possible disamenities of neighbouring painters in tasteless red or lovers of noisy animals. By maintaining a right to prevent such activities, the developer is effectively instituting private ‘zoning’ arrangements which may be a less costly solution to these environmental problems than relying on negotiations between the occupants of neighbouring houses after they have all moved in. Whatever may be the economic rationale of the pri­vately established ‘chicken-free zone’, for the moment the important point is that the bundle of rights purchased by the ‘buyer’ of a house is not all encompassing.
  2. If the owner lets the property, he or she thereby transfers the rights of use to the tenant. The tenant now has a bundle of rights in the use of the house. The landlord can no longer enter the house when he or she likes, may possibly be forbidden from charging more than a specified rent (if there is rent control), and may have to give a certain length of notice to the tenant. The tenant on the other hand has the right to live in the house and use the assets for a specified period of time. Under rent control the tenant may have ‘security of tenure’ so that he or she has the right to use the stock for as long as desired providing he or she pays the rent. This right of use might even be inherited by descendants of the tenant.
  3. The right of an owner (or a tenant) to change the asset may be severely limited. As was mentioned in Chapter 2, building an extension to a house will normally require gaining the consent of the local authority planning department who thereby have considerable influence on the amount and type of development.

All these factors influence what a person can do with housing resources and hence the benefit that is derivable from them. The value to an individual of any resource thus depends on the property rights associated with it. As Demsetz (1967) puts it: ‘Property rights are an instrument of society and derive their significance from the fact that they help a man form those expectations which he can reasonably hold in his dealings with others . . . An owner of property rights possesses the consent of fellow men to allow him to act in particular ways’ (p. 31).

2. TYPES OF PROPERTY RIGHTS

2.1. Private Rights

When it is said that a person has private rights in any resource, it means that the particular person concerned and no one else has the authority to decide how the resource should be used. As we have seen, this does not imply that the person is unconstrained in his or her choice. The choice must be from a ‘non-prohibited class of uses’ (Alchian, 1977b, p. 130), but the individual person with private property rights can prevent other people from using the resource in ways of which he or she does not approve. It is important to understand that this definition does not imply that all the property rights associated with a given resource are in the hands of a single person. Rights to use a resource may be partitioned between two or more individuals, as in the case of landlord and tenant, but the rights held by the landlord and the rights held by the tenant in the housing stock are private rights. The landlord can prevent the tenant, or anyone else, physically changing the housing stock or subletting it to another person (unless, of course, the tenant has purchased the latter right from the landlord). The tenant can prevent the landlord from using the stock for his or her own private pur­poses. Thus, the fact that different people have rights in the same physical asset does not necessarily imply that these rights are not private. So long as each person holds different rights, and the exercise of one person’s rights in no way impinges upon the exercise of the other person’s rights, both people have private rights in the resource.

2.2. Communal Rights

There are instances in which a person’s right to use a resource in a certain way is held in common with another person or group of people. My right to walk across common land is the same right as that held by everyone else with access to that land. I may use the resource for the purposes of walking, or gathering firewood, or grazing animals, or whatever, but so also can other people. Other important examples of communal rights might include the right to use a watercourse for the disposal of waste products, or the right to fish on a particular stretch of water or at sea, or the right to allow smoke or other waste gases to escape into the air. In each case, a resource (a river, a lake, the sea or the air) may be used for the same purpose by many individuals. The analytical consequences of communal property will be dis­cussed in a later section, but it will be obvious enough that the problems of ‘congestion’, ‘over-fishing’, and water and air ‘pollution’ are bound up with this question of property rights. Communal property rights, however, do not necessarily imply ‘overuse’ of resources if the group of people who hold these rights in common is restricted to an ‘appropriate’ size.2 Thus, a landowner with private rights to the fishing on a particular stretch of river may decide to restrict the use of the river to a selected group of other people. These people can buy a communal right to fish from the landlord, and any landlord wishing to maximise his or her income from selling these communal rights (licences) will wish to restrict their number.

2.3. Collective Rights

With communal rights, each individual makes his or her decision as to when or how to exercise it. If I am going fishing, I do not have to consult the other people who may also have this right. In the case of a collective right or shared right, the decision about the use of a resource is taken as a group. For example, a group of individuals may form a consortium which ‘owns’ a racehorse. This does not imply that individual members of the con­sortium can enter the horse in whatever race they please, or that they can decide independently on how the horse should be treated. Rather, it implies that some collective decision has to be taken about the training and sport­ing commitments of the horse. This will usually (though not necessarily) imply using some voting process to choose a particular person who will make the detailed decisions which most of the members of the consortium may be ill equipped to make. Once a person has been appointed to this posi­tion, the necessary private property rights which will enable him or her to execute decisions and prevent other non-qualified people from making decisions will inhere in that person. A collective right to determine the use of a resource and to share in the results is quite different therefore from a private or indeed communal right. The manager of the resource will exer­cise the private rights which go with executive decision-making.3 These rights will remain for so long as the consortium believes they are being exer­cised sufficiently effectively on their behalf and it is therefore not worth the time and trouble involved in changing the manager.

2.4. Exchangeable Rights

All trade concerns the exchange of property rights, but not all property rights are tradable. Consider, for example, the tenant in a rent-controlled apartment. Such a tenant has the rights of use which we discussed earlier. These rights have no market value, however. The tenant cannot sell the rights to live in a particular house at a controlled rent to anyone else. This lack of tradability can have important consequences, as the eclipse of the private rented sector in the UK helps to testify. Suppose, for example, the value of a house available for owner-occupation on the market was £50,000. Now imagine that the same house has a tenant paying a below-market rent set by a controlling agency. Clearly, the value of the house with a sitting tenant will be less than £50,000 (perhaps £30,000) depending upon the level of the controlled rent, expectations concerning the future of rent control, or the likelihood that the tenant will move. The result is that the market value of the tenant’s rights (zero) plus those of the landlord (£30,000) falls short of the value of the property unencumbered by the tenant (£50,000). It follows that both landlord and tenant will have a mutual interest in changing the allocation of property rights in the house. By creating a free­hold, there is a potential £20,000 of capital gain to be shared between them. Thus the tenant might offer the landlord (say) £40,000 for his or her rights in the house. This is £10,000 more than their value on the market, but by combining the landlord’s and tenant’s rights in this way, the tenant creates the freehold which, as we saw, was worth £50,000. Both landlord and tenant thus each make a gain of £10,000.

This example is an illustration of an important principle which will be encountered again in differing disguises. The nature of property rights in resources is expected to change when someone perceives that existing rights holders could all be better off by agreeing to such a change. Sometimes this will involve combining hitherto separately held rights in a single holder (as in the landlord-tenant example under rent control). Sometimes, though, it might involve disentangling different rights, at present held by a single person, and then allocating them to different people. Financial markets provide examples of this process. A government bond which promises to pay £5 per year until repayment of the principal in the year 2010 can be held by a single person, but the right to £5 per year until 2010, and the right to the prin­cipal (say £100) in 2010, are quite distinct and could be sold separately on the market. If the market values of the two separate rights sum to more than the market value of the combined rights (the bond) it will pay some financial intermediary to buy bonds and split them down into their component parts.4

Just as private rights may be exchangeable or non-exchangeable, the same applies to collective and communal rights. Consider first the case of communal rights. The purchase of a licence to fish in a particular area results, as we saw, in a communal right (unless of course there is only a single licence). This right may or may not be tradable. If I break my arm after buying the licence, I may be able to sell it to someone else, in which case possession of some document is presumably sufficient to procure admission. If, on the other hand, the licence applies to a single named indi­vidual, it is worthless to anyone else, and will have no exchangeable value. Membership of a club which allows people access to some communal prop­erty constitutes another example. Usually such communal rights are not marketable for the simple reason that when communal access to some resource is involved, the other members of the club will want a say in decid­ing the eligibility of new members.5 Willingness to pay the highest entry fee may not be the deciding criterion. Assessment of character and the proba­bility that the new member will take due care of the communal property may be equally important. On the other hand, the incentive to take care of communal property would appear to be stronger when rights of access are exchangeable than when they are not, since failure to do so will be reflected in a falling market value of club membership as facilities deteriorate.

Company shares represent the classic case of exchangeable collective rights. Members of the consortium that owned the racehorse in our earlier example would normally be able to sell their ‘share’ in the racehorse to another person. We will be investigating the property rights structure of different types of company in more detail shortly. Collective rights in assets taken into ‘public ownership’, such as nationalised undertakings and departments of state, are clearly not exchangeable. This lack of transfer­ability is the crucial distinction between collective rights in the assets of the state and collective rights in the private sector. As Alchian (1965) expresses it: ‘The differences between public and private ownership arise from the inability of a public owner to sell his share of public ownership’ (p. 138).

The right of exchange may be exercised privately or collectively. In the case of the shares of a joint-stock company, the shareholder can trade his or her holdings without gaining the approval of other people. An individ­ual decision to buy and sell shares can be made at any time. It is for this reason that shares in a joint-stock company are usually regarded as private assets. In fact they are privately exchangeable titles to collective rights. The share is a bundle of rights, one of which (the right to exchange) is a private right. It is slightly paradoxical to note that in ‘private’ companies the right to exchange shares is more circumscribed and cannot be exercised purely privately. In the case of public ownership mentioned by Alchian, we might insist that a public owner can sell his or her share of ownership but only through the exercise of a collective right of exchange. Presumably this is what happened in the privatisation programmes pursued in many countries during the 1980s. Taxpayers collectively ‘decided’ through their representa­tive institutions of government to sell their collective rights in the assets of the nationalised industries.

2.5. Alienable and Inalienable Rights

Rights which cannot be reassigned to someone else are sometimes called ‘inalienable rights’. Because they cannot be reassigned, the entrepreneurial function of intermediation which we discussed in Chapter 3 has no scope to operate in a world of inalienable rights. Some rights must be alienable if there is to be any coordination problem to solve. Exchangeable rights are clearly alienable, but the converse does not hold. A manager’s rights to make decisions about the allocation of shared resources are not exchange­able but they are certainly alienable. A collective decision by shareholders can dismiss the manager.

2.6. Exclusion

If rights are to be exchangeable on the market they must, of course, be denied to people who have not acquired them through gift or exchange. If people cannot effectively be excluded from fishing a stretch of river because the costs of policing the river bank are very high, the market price of a licence will be zero. No one will voluntarily pay for a right they can without penalty acquire for nothing.6 Thus the ability to exclude others from using a resource is a necessary condition of exchangeability and hence of markets in property rights. If exclusion cannot be effected, the resource is perforce ‘communal’; we all have a right to grab what we can get. The converse does not hold: communal property may or may not permit exclusion, as was seen in the section on communal rights.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

The development of property rights

Thus far, our attention has primarily been confined to describing property rights and presenting a simple taxonomy. Several questions now arise. Can we explain the development of different types of property rights? How will the nature of a person’s rights influence behaviour? Do property rights matter? One possible response to these questions is that of Demsetz (1967, 1979). Demsetz concentrates on the issue of economic efficiency. We have already seen how the process of exchange gives rise to efficiency gains. If, as a result of resource reallocation, everyone is made better off (as in the example of Chapter 1) the new allocation of resources is said to be ‘Pareto superior’ to the old allocation. It was from these ‘efficiency gains’ that the entrepreneur was seen to draw ‘pure entrepreneurial profit’ in Chapter 3. In Chapter 2, on the other hand, the obstacles to the realisation of gains from exchange, ‘transactions costs’, were considered. Demsetz makes the point that transactions costs are not independent of the types of property rights in which trade is taking place. As a result, some change in the structure of property rights in a resource may be required before potential efficiency gains can be appropriated.

Consider once more the case of the four islanders A, B, C and D. Let us suppose that somewhere on the island is a freshwater lake containing fish. Each person is equally skilled at catching fish and equally conveniently located with respect to access to the lake. The fish are a delicate species living in a finely balanced harmony with predators and prey. One person fishing the lake can take ten fish per day, but two people fishing will find that only 16 fish per day are sustainable. A maximum yield per day of 18 fish can be achieved with three people, and any further fishing effort will actually reduce the sustainable yield of fish obtainable from the lake by depleting the stock of fish by more than can be compensated by the greater fishing effort. Table 4.1 records these illustrative figures for total social product of fish along with average and marginal social product schedules.

From the figures in Table 4.1, we see that each person fishing in the lake imposes on the others an ‘external diseconomy’. Suppose that A is fishing alone and taking home ten fish per day. Person B now comes along and starts fishing. Both A and B each take home eight fish per day. B’s fishing has reduced A’s catch by two fish per day. It would, however, be equally correct to say that A’s presence at the lake reduced B’s catch by two fish per day. The relationship is perfectly reciprocal. Each person imposes external diseconomies on the other. Real external diseconomies in production occur when one person’s actions affect the production possibilities faced by others, and the four individuals around the island lake are clearly interde­pendent in this way. The fishing resource is subject to congestion as reflected in the declining schedule of average social product (ASP) as the number of people fishing increases.

Assume now that all four islanders have a communal right to fish in the lake. No one is excluded from fishing, and hence the right to fish there has no market value. Each person will therefore fish as long as the product he or she takes home exceeds the value of the alternative uses of the time expended (the value of the fish exceeds its opportunity cost). Let this opportunity cost be the equivalent of four fish per day.7 Since the private return to each person is the average social product (ASP), extra people will fish if ASP is greater than four and this implies that all four people on the island will fish in the lake.

Of course, from the point of view of the group of islanders as a whole, this outcome is not ideal. They could all be better off by restricting access to the lake. This is more easily seen by considering the ‘surplus’ which the community as a whole has derived from the resource. The ‘surplus’ is simply the total social product of fish minus the opportunity costs of catching it. For the case of three fishermen, total product is 18 and total social cost is 12, thus resulting in a ‘surplus’ of six. With four people fishing, this surplus has declined to zero. The presence of the lake has conferred no net benefit on the four individuals. Between them, they have managed to sacrifice else­where things equivalent in value to the fish they have caught. If access to the lake were restricted to two people, the social product would be unchanged at 16, while the other two people would be free to take leisure or to produce additional goods elsewhere, goods or leisure which we have assumed are valued as equivalent to eight fish.

Restricting access to the lake implies changing the nature of the property rights which people hold. The lake can no longer be unrestricted common property. We can imagine many different ways in which this might come about.

  1. All four islanders might come to some mutual agreement whereby they renounce their communal rights in exchange for a collective right to a share in the produce of the lake. This ‘fisheries consortium’ will then have an incentive to ensure that only two people actually fish in the lake.
  2. One of the islanders, perhaps the entrepreneur E of Chapter 3, might simply try to buy up the common rights of the others. Providing that the price of these rights is less than the surplus expected from the com­mercial exploitation of the lake, a profit will be achieved on the trans­actions. The bargaining and transactions costs involved in this strategy are likely to be considerable, however, because those who ‘hold out’ against the offers made by person E will in the end find themselves in a very strong bargaining position.8 It is this problem which is often used to justify an element of coercion by the state when bargaining costs threaten the achievement of potential efficiency gains. The power of compulsory purchase, for example, has been defended as a method of reducing transactions costs in situations which would otherwise be subject to the problem of ‘hold-out’.
  3. A third possibility is that one of the islanders announces to the others that from henceforth they cannot fish in the lake without his or her consent. Common rights are confiscated by force and replaced by private rights.

The establishment of property rights may be modelled as a Hawk-Dove game. Suppose, for example, that the social gains available from the more effective use of the lake are valued at two. Two ‘Doves’ would meet and split the benefit between them (as in the first of our three scenarios). A ‘Hawk’ meeting a ‘Dove’ would take all the gains by the threat of force. Two ‘Hawks’ meeting, however, might do each other great damage. Let us suppose the payoff to each in this situation is -2. The payoffs are recorded in Table 4.2.

If all of the islanders are equally equipped to fight and equally likely to win in the event of a fight, the expected payoff to the two possible strategies will depend upon the probability of meeting a ‘Hawk’ or a ‘Dove’. A high prob­ability of meeting a ‘Hawk’ makes a ‘Dove’ strategy more attractive, and vice versa. With the payoffs as recorded, a probability of meeting a ‘Dove’ of 2/3 will result in the same expected payoff to each strategy. Biologists use this type of argument to model the evolution of a stable proportion of aggressive to non-aggressive animals in a certain species.

The outcome would be different, however, if the islanders could be dis­tinguished in some way and if this distinction began to be associated with the probability of their playing a ‘Hawk’ or ‘Dove’ strategy.9 Islander B might, for example, live closer to the lake, or have fished in the lake for longer. If proximity to or use of a resource is recognised by the players as conditioning the likelihood of playing a given strategy in the Hawk-Dove game, conventions of property can evolve. A convention might become established such as ‘play the Hawk strategy if you live closer to the resource than your opponent’ or ‘play the Dove strategy if your opponent has used the resource for longer than you have’ and so forth. Just as in the repeated exchange game of Chapter 1, these property conventions could become self-enforcing. As an islander it would be in my self-interest to recognise the sorts of characteristics likely to determine whether an opponent would fight or not. In other words it pays me to recognise emergent rules of property.

Whatever the mechanism by which changes in property rights are effected, one result is the same. By restricting access to the resource, efficiency gains are achievable. The distribution of these gains will depend on the property assignment between the islanders. They may all accrue to the strongest person, to the person who lives closest to the lake, to the only person with a gun, to the entrepreneur E, or they may be shared between members of the fisheries consortium. Whatever the rights assignment turns out to be, a licence to fish in the lake set at a price equivalent to four fish would reduce the number of people fishing to two, and procure an income of eight fish for the resource owner or owners.

Figure 4.1 illustrates the entire argument geometrically. The average social product of fishing effort is depicted by curve ASP. Since, by assump­tion, all people are equally skilled at fishing, the private return to extra fishing effort will equal the average social return. This implies that people will continue to supply more fishing effort as long as average social product exceeds marginal private cost. Marginal private costs (equal to social costs) are assumed constant at c. Thus, with unrestricted entry into fishing we find a quantity of fishing effort F1 provided. This is inefficient, since the mar­ginal social product of fishing effort (MSP) is well below marginal social cost (MSC) at this level of effort. Efficiency requires a level of fishing effort F2, where MSP = MSC. The vertical distance between ASP and MSP rep­resents the external cost imposed on others by extra fishing effort. Similarly the vertical distance between ASP and MPC represents the private benefit accruing to extra fishing effort. Clearly, if external costs exceed private benefit it will be possible for people to get together to bribe some of their number not to fish, since the bribers will gain from reduced congestion more than they have to pay in compensation to the people being bribed (see especially Coase, 1960).

The gain to restricting fishing effort in this way is given by area bfe. Once private rights to fishing have been established, a licence fee per unit of fishing effort (l) can be introduced, the private marginal costs of fishing will increase to l + c = a and fishing effort will fall to F2. Total revenue from the sale of licences will be area acbd and in these circumstances will precisely equal the efficiency gains achieved (area bfe).10

For Demsetz, therefore, property rights will change when people find that there are substantial efficiency gains (mutual benefits) to be derived from such a change. These benefits must be sufficient to compensate for the transactions costs involved in establishing and in policing the new structure of rights. If people cannot, except at prohibitively high cost, be prevented from fishing in the lake, no changes in the structure of rights will emerge. On the other hand ‘prohibitive’ here means ‘relative to potential benefits’, and the more congested the lake becomes the larger become the efficiency gains from mitigating the problem, and the more likely therefore that the policing costs are worth incurring.

Demsetz cites as an example of this process changes in the rights to use land among the Indians of Labrador. It appears that in the seventeenth century there was no restriction on hunting rights. The development of the fur trade, however, gave rise, according to this theory, to the division of territory among hunting bands. Clearly, without such rights each hunting
party could impose severe external disbenefits on the others. When animals were hunted merely for food and clothing for the Indian popula­tion, the problem of external costs was insignificant, but with commercial development this was no longer the case and more restrictive property rights emerged. Demsetz then poses the question: why did a similar process not occur among the plains Indians of the south west? One answer is that the cost of defining and policing property rights on the plains would have been much higher than in the forests. Animals grazing on the plains wander over vast areas; those in the forests are more restricted in their movements. Thus, simple delineation of a given policeable area is not sufficient to appropriate a right to tend and harvest a particular herd on the plains, and property rights remained common and open. The conse­quence of this structure of property rights for the buffalo herds is, of course, well known.

Although property rights may develop in ways conducive to the more efficient use of economic resources, it should not be assumed that this ten­dency is inevitable. The point about conventions is that there are many pos­sible ones that might evolve and that, once evolved, they are self-enforcing. Whether the conventions most likely to evolve turn out to be the ones that, once established, are the most efficient, is not certain. As we have seen, some writers have tended to assume that a benign process of social development can be expected over time. Others are less sure. Further discussion of this issue must await Chapter 12.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Team production and the classical capitalist firm

Just as the development of property rights in resources subject to conges­tion can be seen as an attempt to achieve efficiency gains, so the develop­ment of institutional structures such as firms can be viewed in the same light. In Chapter 2, the firm as a device to economise on transactions costs was considered in some detail. We did not emphasise at that stage, however, that the contractual relationships found within ‘the firm’ estab­lish a structure of property rights in the use of resources. In a classic paper, Alchian and Demsetz (1972) elaborated on this theme and argued that the structure of property rights observed in the classical capitalist firm was a response to transactional problems, and in particular to the problem of ‘team production’.

The essence of the firm for Alchian and Demsetz is that it permits people to work as a team. Team production occurs when an output is produced by the simultaneous cooperation of several team members. Production is not a sequence of identifiable stages by which a series of intermediate products are gradually transformed into the final output. Rather, the final output is the joint result of the combined efforts of all the inputs working at the same time. It follows that the individual contribution of each member of the team to the final output cannot be isolated and observed. All that can be observed in terms of output is the combined result of the entire team’s efforts.

A further complication is that any one person’s activity may affect the productivity of the other members of the team. In these circumstances there will exist an incentive for people to get together and agree to take account of external effects in their behaviour. In the last section, we showed how people could gain by forming a ‘fisheries consortium’ if their fishing activities imposed external disbenefits on each other. Here, the same argument can be used to show that a collective agreement to modify behaviour may be useful in the presence of external benefits. Person A agrees to work a little harder on the understanding that person B will do likewise. The benefits in terms of higher output of their joint decision to work harder will be sufficient to compensate them both, although any indi­vidual commitment to greater effort in the absence of the other party would not have conferred net private benefits on the person undertaking the extra work.

The situation is analogous to the public goods problem discussed in Chapter 2. No individual person may have an incentive to provide a public good, although a joint decision to produce one may confer benefits on everyone. It will be recalled that a joint decision was difficult to arrange because each person had an incentive to understate his or her true valua­tion of the public good in the hope that other people would pay to provide it. People, in other words, would tend ‘to shirk’ and fail voluntarily to pay their contributions. In the same way, a joint agreement to work harder in order to increase team output will be difficult to implement unless each person’s behaviour can be monitored. Without monitoring, each person will ‘shirk’ and hope to ‘free ride’ on the effort of other people. Note that this problem would not arise if an identifiable output could be assigned costlessly to each person, for then a contract linking reward to performance would be possible. In the case of ‘team production’, however, there is a single output produced by the simultaneous cooperation of all members of the team, and the individual contribution of each member cannot be sep­arately identified.

Problems of ‘moral hazard’ will therefore have to be overcome if the potential advantages of team production are to be achieved. The ‘solution’ suggested by Alchian and Demsetz is that the team requires a ‘monitor’ to observe the individual members and to check that their effort is satisfactory.

Clearly, this solution requires that effort is observable and this will obvi­ously not always be the case. Where the team is concerned with the coordi­nation of fairly simple ‘manual’ operations, the observation of the inputs to ensure that they perform the tasks they contract to perform may not be very costly. In other cases, observation of behaviour may be a very imper­fect guide to the effective input of the team member involved. Further, as we saw in Chapter 2, a monitor may not have the information to judge whether the actions taken by a particular person are or are not in the inter­ests of the team as a whole. The problem is equivalent to person A’s difficulty of contracting with his or her architect.

Assuming for the time being that a monitor is capable of observing the effort of team members, the problem remains of providing the monitor with some incentive to bother. If the monitor is simply another member of the team whose job is to check that all other team members are fulfilling their contractual commitments, the monitor would have as much incentive to shirk as anyone else. It is for this reason, argue Alchian and Demsetz, that the monitor becomes a residual claimant. Each team member receives a contractual reward in the form of a wage, and the monitor receives what­ever remains after these payments have been made. The more effectively the team operates, the bigger the residual will be, and hence the monitor will have a definite interest in promoting the efficiency of the team. All the benefits from improved coordination will accrue to the monitor instead of being shared amongst the team members.

If the status of residual claimant is to provide the monitor with an incen­tive rather than merely an interest, he or she must be able to discipline team members. It would be pointless monitoring the behaviour of team members if they could then ignore the monitor’s criticisms. Thus, the monitor becomes the common party to all contracts with the power to alter these contractual arrangements and to add and subtract from the team (to hire and fire). Note the difference here from Coase’s view of the firm. Coase emphasised the costs of arranging detailed multilateral contracts as an explanation of the firm. Alchian and Demsetz emphasise the necessity of the monitor having control of contractual arrangements in the context of team production if shirking is to be reduced.

From the perspective of property rights theory, therefore, the traditional single proprietorship can be seen as a form of enterprise which concen­trates property rights in the hands of a single person. The ‘private’ nature of these property rights gives the possessor the maximum incentive to con­sider the consequences of his or her actions for the market value of the rights. The more effectively the single contractual agent or proprietor mon­itors and organises the team, the greater is the residual claim and the more valuable will be his or her property rights on the market. Exchangeable private rights to determine the use of team resources, monitor operations and claim the residual, represent a response to the moral hazard problem posed by shirking in the context of team production.

Although Alchian and Demsetz in their original (1972) paper saw team production as the primary source of the moral hazard problem, it is evident from the arguments reviewed in Chapter 2 that, even in the absence of team production as we have conceived it, asymmetric information can lead to problems of moral hazard. Whenever it is difficult to assess the quality of an intermediate product, for example, the supplier may have an incentive to shirk and there will be an advantage in appointing a specialist monitor. In this way, a purely market transaction becomes a transaction conducted within a firm. We will consider this process in more detail in a later chapter on the subject of vertical integration. For the present, it is merely necessary to note that the single proprietorship can be considered as a response to the ‘shirking’ problem and that this does not require us to assume conditions of team production rather, as Williamson (1975, pp. 49-50) emphasises, this may be an appropriate view wherever information difficulties lead to oppor­tunistic behaviour.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Alternative structures of property rights

In the last section we focused attention on the structure of property rights characteristic of the single proprietorship. Common observation tells us, however, that, numerous and economically important though such arrangements are (for example, Storey, 1982; Bolton, 1971), the modern economy has developed institutions of far greater complexity. An explana­tion of these more complex institutions must ultimately reflect the idea that concentrating property rights in a single holder is not necessarily the most efficient structure. Sometimes the sharing of rights between people or the apportioning of different private rights between people may be efficient. Thus the full package of property rights held by a proprietor may instead be shared between two or more people in a ‘partnership’, or alternatively the right to claim the residual may be shared between one group of people and the right to monitor the inputs may be held by another as in a ‘joint stock’ company.

At first sight such a statement appears to contradict flatly all that was argued in the previous section concerning the necessity of overcoming the moral hazard problem posed by shirking and the resulting concentration of property rights. It is evident, however, that the complete avoidance of all moral hazard problems is neither feasible nor efficient. If this were not so

it would be a simple matter to circumvent moral hazard by refraining from all contractual relations with other people and forgoing the benefits of divi­sion of labour and exchange. The maximum concern for fire prevention and theft prevention can be achieved no doubt by abolishing insurance markets, but few people would advocate such a move or claim that economic efficiency would be enhanced. It may be worth while tolerating reduced incentives if the benefits of a more efficient distribution of risk taking are sufficiently great. Conversely (and this is something we shall discuss in more detail in Chapter 5 on principal and agent) if a perfectly efficient distribu­tion of risk taking involves severe problems of moral hazard it may be worth while sacrificing risk-sharing benefits in the interests of providing incentives. In other words we might expect observed contractual relations to reflect the available trade-off between risk-sharing benefits and effort incentives, a trade-off which will be affected in any given case by the costs of monitoring.

1. The Single Proprietor

Consider once more the single proprietor. A team endeavour requires a monitor if it is to operate effectively. The problem of providing incentives to the monitor is then encountered. This can be viewed as a classic ‘agency’ problem. If members of the team cannot tell whether the monitor is per­forming the promised services, they have to devise some incentive structure based not on unobservable behaviour but on observable outcomes. The one thing that, by assumption, is observable by everyone is the final output of the entire team. Instead of a system in which this output is shared between all members of the team therefore, a form of organisation evolves in which team members receive a fixed wage (irrespective of overall team perfor­mance) and the monitor receives the residual. In Chapter 5 we shall inves­tigate more carefully the circumstances in which we expect an agent to promise a principal a fixed sum in this way and thus to relieve the latter of all risk. However, it is not offensive to the intuition to learn that if the agent is risk neutral and the principal is risk averse the efficient contract between them will involve the agent bearing the entire risk.

It is very important to understand the nature of the ‘thought experiment’ conducted in the above paragraph. The single proprietor is clearly not an agent in a legal sense. As we saw in an earlier section, the proprietor must be the employer, not merely the agent of the team, if he or she is to have the authority to influence team behaviour. However, in piecing together a ratio­nale of the proprietorship, it is defensible and indeed enlightening as a first approximation to regard the structure of incentives embodied in this form of organisation as a solution to an agency problem. In a similar spirit, we will later on consider how far and in what circumstances an employee might be considered an agent of the manager or the manager an agent of the shareholder.

Let us suppose now that the moral hazard problem is so severe that in the absence of a monitor-employer the team could not survive in any shape or form. On the other hand, assume that the returns to monitoring activi­ties are substantial so that a proprietorship is viable. If the proprietor is risk neutral and the employees are risk averse, the traditional structure with the proprietor receiving the residual and the employees a given wage will be efficient. Notice how akin to Knight’s is this conception with the ‘confident and venturesome’ providing insurance for the ‘doubtful and timid’. Alchian and Demsetz, however, explicitly reject the Knightian risk-sharing approach to the firm and prefer to concentrate instead on the advantages of team production. Yet any final organisational form, with its structure of property rights held by the members of the organisation, will presumably reflect all the forces which we have so far discussed:

  1. The potential advantages of further specialisation (division of labour) or team production.
  2. The extent to which the exchange relationships involved in point 1 above give rise to problems of moral hazard.
  3. The returns to monitoring.
  4. The trade-off between risk-sharing benefits and incentives.

The single proprietorship is a ‘solution’ to the problem of organisational form under rather special circumstances:

  1. The potential advantages of specialisation or of team operations are limited to groups sufficiently small to be efficiently monitored by a single person.
  2. Moral hazard problems are severe, but . . .
  3. The returns to monitoring are such that at least over a certain range the extra output of the monitored team is more than sufficient to compen­sate the monitor for his or her effort.
  4. Either (a) the returns to monitoring effort are certain, or (b) monitor­ing effort favourably affects the probability distribution of the residual by increasing expected output net of monitoring and contractual costs, and the monitor is risk neutral.

By setting out the conditions most favourable to the establishment of a single proprietorship in this way we begin to perceive the circumstances in which alternative organisational forms might be observed.

2. The Partnership

2.1. Monitoring costs

We start by assuming that returns to monitoring effort are certain. In Figure 4.2, the curve labelled MCA = MCB represents the marginal costs to monitors A and B of different levels of monitoring effort. It measures the extra monetary payment required to induce them to exert one more unit of effort. Curve MB1 indicates the marginal returns to extra effort. We suppose that after a certain point (E1) the marginal cost of monitoring effort rises and eventually becomes vertical as the limits of human endurance are reached. The returns to extra monitoring of the team decline throughout. A single proprietor (say person A) facing curve MB1 would put in effort level E1 where the extra returns from marginal effort are just equal to the compensation required. The final reward of the monitor will depend upon the level of effort required for the team to break even. Suppose for example, monitoring effort E0 is required if contractual payments to other team members are to be met from the value of output. All further moni­toring effort will produce a residual which can be claimed by the monitor. Since we have assumed that returns to monitoring effort are certain, we would expect this residual will just be sufficient to compensate the monitor for the costs he or she incurs (including wages forgone as a team member).12 Thus, the additional benefit to monitoring effort above E0 (the value of the residual, areas A + C) will be just equal to the costs incurred in being a monitor (areas B+ C). Hence, under conditions of a certain return to mon­itoring effort, area A = area B.

Now suppose that the returns to monitoring effort are given by MB2 in Figure 4.3. Under these conditions a partnership of two people monitor­ing together will exert effort level 2E1 =E2. The marginal cost of effort at this point is the same for each partner, but we might ask why a single pro­prietor should not monitor the team and exert effort level E1‘? The answer is that competition from the partnership form of organisation will under­mine the single proprietorship. Once more, assume that effort level E0 is required for the team to break even. It follows, as before, that where the returns to monitoring effort are certain, both partners will just be com­pensated for the costs of monitoring the team. Thus area A'(abc) = area B'(of aE0). But if area A’ equals area B’, a single proprietor exerting effort E1‘ would not be able to earn enough to compensate for the work involved. Clearly, the residual of the single proprietor will be bdE1‘E0 which will fall short of total monitoring costs (the total area under MCA up to Ej’) since area aedb<B’. The essential point is extremely obvious.

If a partnership can monitor at the same marginal cost as a proprietor­ship or lower up to E1‘ and can further afford to monitor to a higher level of intensity (E2), the partnership form of enterprise will take over from the proprietorship and the efficiency gains represented by area edc will be the prize.

The assumption that the monitoring costs faced by each individual are uninfluenced by the forms of organisation is, of course, crucial to this piece of analysis. A partnership involves an agreement between two or more people to perform certain monitoring services in exchange for a specified share in the residual. Even when the returns to monitoring effort are assumed to be certain, therefore, contractual difficulties are likely to be encountered. If the monitoring effort of each partner is perfectly and cost­lessly observable by all the others, partnership arrangements will be a pre­dictable response to the increasing productivity of team effort and hence will permit larger team sizes than would otherwise be possible. Where, however, the behaviour of each monitor is costly to observe, Alchian and Demsetz’s moral hazard problem reasserts itself. The effort of one monitor (partner) confers benefits on the others and the result will be an incentive to shirk. We therefore deduce that partnerships are more likely to evolve where the process of monitoring is itself routine and susceptible to a degree of accountability than where the effort of each partner is almost impossi­ble to observe or deduce. Where the returns to monitoring are certain, two partners should each be able to police the activity of the other since each will know their own effort and can deduce the effort of the other from the final team output.13 As the number of partners increases, however, the incentive to shirk will rise, since assigning individual responsibility for poor team performance may become impossible, and the effort of any individ­ual partner will have a smaller and smaller effect on the value of his or her share.

Our discussion of the partnership, thus far, leads to the conclusion that sharing the right to the residual may hold out the possibility of potential efficiency gains by reducing the marginal cost of monitoring large teams. Against these potential gains we must set the extra problems of moral hazard which may arise when rights are shared in this way. If monitors begin to shirk, the efficiency gain (area edc) in Figure 4.3 may be dissipated and the single proprietorship will continue as the most effective organisa­tional form. At any rate, partnerships are likely to be fairly small and, given the trust which must exist between partners if they are to avoid the losses involved in opportunistic behaviour, it is expected that the use of close family connections and those amenable to peer-group pressure will be fre­quent. A more extended discussion of profit sharing as an institutional form appears in Chapter 10.

2.2. Risk sharing

The assumption that returns to monitoring are certain is a convenient simplification when discussing the trade-off between monitoring costs and the hazards encountered when the residual is shared. As was noted earlier, however, the residual received by the monitor is unlikely to be determinis­tically related to monitoring effort. By accepting a residual reward, the monitor is exposed to risk. The fact that the residual may vary for reasons unrelated to monitoring effort has important implications.

  1. In the first place the suppliers of ‘contractual’ resources to the team effort will want assurances that, if the residual turns out to be negative, they will still receive the promised payment for services rendered. This implies that a monitor will require some personal wealth to act as ‘col­lateral security’. We discussed at some length in Chapter 3 the possi­bility that an entrepreneur with no wealth might persuade others to provide finance, but, accepting that this may occur, there are clear limits to the resource inputs which can be acquired in this way. Risk implies that the size of a single proprietorship will be limited by per­sonal wealth and that partnerships will be necessary if firms are to grow beyond these limits.

It is worth emphasising that the above argument relies on the premise that the risk faced by the monitor is ‘uninsurable’. Even if the residual could be represented by a probability distribution conditional upon effort, and was thus ‘risky’ in Knight’s strict sense, insurance markets would succumb to the moral hazard problem if the monitor’s effort were not observable. If the monitor were certain of achieving a given residual through an insurance contract, the incentive to exert effort would be entirely lost, and with it the whole point of giving the monitor the residual claim.

  1. Where the monitor is risk neutral, the concentration of risk is efficient as we have seen. Where, however, the monitor is risk averse along with other members of the team it makes no more sense to concentrate all the risk on such a person than to insist that someone confronting rapidly rising marginal costs of effort should do all the work. By taking a partner, the risks of the enterprise are shared, and where both part­ners are risk averse, total risk-bearing costs will decline. Thus, just as we showed in subsection 5.2.1 that partnerships might permit lower monitoring costs, they might also permit lower risk-bearing costs. It is then the sum total of these two possible efficiency gains which must be set against the moral hazard problems arising from sharing a property right in the residual.

The history of the partnership form of enterprise illustrates the operation of these conflicting forces. Each partner, in addition to sharing in the resid­ual, has rights to use and manage the resources of the team. The decisions of each can therefore bind the others, and the partners are responsible for all debts, whether or not as individuals they were personally involved in incurring them. Indeed, the English law of partnership developed the rule that each partner was liable ‘to his last shilling and acre’. In the face of this stringent legal background of unlimited liability, it is not surprising, as was noted earlier, that the property rights of each partner are not freely trad­able. If a partner withdraws or dies, the partnership is broken and has to be reconstituted. Thus, it is very complicated and difficult for a partner to extricate his or her share of the resources from the business.

Because in the case of a partnership there is no single contractual agent but several agents capable of contracting on behalf of each other, the trans­actional difficulties involved in this type of enterprise are substantial. During the 1830s in the UK, when dissatisfaction with the law of partner­ship was growing, one of the major issues concerned the difficulty faced by a third party in suing a partnership or vice versa, and the difficulty involved in one partner suing another. Grievances between partners were particu­larly troublesome given the difficulties of acquiring information, and cases were reported ‘which were upwards of thirty years in the Court of Chancery’.14 The fact that the 1837 report on partnerships was particularly concerned ‘with regard to the difficulties which exist in suing and being sued where partners are numerous’ is indicative of the transactional prob­lems encountered by large partnerships. These problems rapidly cancelled any risk-sharing or monitoring benefits available, and effectively placed a limit on the size to which a partnership could grow.15

3. The Joint-stock Company

Large-scale enterprises involving the cooperation of thousands and even hundreds of thousands of individuals would clearly not have evolved had the property rights structures characteristic of the single proprietorship and the partnership been the only possibilities available. The potential gains available from the monitoring of large teams – the ‘visible hand’ as Chandler (1977) has termed it – required a new structure of rights to emerge, a structure which did not expose the managers of large-scale enter­prise to a degree of risk which they were not prepared to shoulder, and which permitted capital to be supplied by many people who would play no part in day-to-day business decisions. This particular combination of char­acteristics was impossible to achieve under the strict law of partnership. Capital could be borrowed, no doubt, at fixed interest from many people, but only at the cost of tolerating a very high ‘gearing’ or ‘leverage’ in the financial structure. Such high ratios of debt to proprietor’s or partners’ wealth would increase the risk of insolvency; a spectre made even more appalling by the provisions of unlimited liability which effectively meant personal ruin in the event of business failure on such a large scale. As we have seen, the alternative of growing through the addition of new partners was rendered unattractive because of the transactional difficulties involved and the enormous trust required in the integrity of other members of the partnership.

The joint-stock company developed as a response to these difficulties. For our purposes, there are three characteristics of great economic impor­tance.

  1. A joint-stock company has a legal existence quite distinct from the people who comprise the company at any given point in time. People may come and go but, unlike the partnership, the company continues in existence. Further, as a separate legal entity, a joint-stock company can sue and be sued. This greatly simplifies contractual relations with third parties and helps to overcome some of the difficulties alluded to in the previous section.
  2. The shares of public companies are freely exchangeable. A market can therefore develop in these shared rights (the stock exchange) and it is a relatively costless exercise to buy or sell an interest in any particular company. Ekelund and Tollison (1980) argue that this ease of transfer­ability was important in the early history of the development of the joint-stock form of enterprise. Lack of transferability would inhibit the most talented and qualified people gaining control of productive resources, which would instead remain in the same hands or in the same family for many years. Over the long run, the flexibility offered by joint- stock enterprises in reassigning property rights to more energetic people would give them an advantage over alternative institutional forms.16
  3. The third important characteristic of joint-stock enterprises is that the liability of shareholders is limited.17 With unlimited liability, people will naturally be chary of business associations involving people they do not know personally. With limited liability the prospect of sub­scribing relatively small amounts to an enterprise will be more tolera­ble, in the secure knowledge that the rest of a person’s fortune is not inevitably at hazard in the same enterprise. Perhaps a more important implication of limited liability than the effect on the willingness of people to supply finance (as we have seen they could always lend at fixed interest to other types of enterprise) is the willingness of man­agers to raise finance. For the directors of a joint-stock company are themselves liable only to the extent of the shares they hold in the company and indeed there is no legal requirement that they should hold any. With risk spread widely in this way, rising costs of risk­bearing do not constrain the size of operations as severely as they do in a partnership or proprietorship.

It is sometimes said that the coming of limited liability and the joint-stock enterprise lowered the ‘cost’ of finance. This is a somewhat misleading way of thinking, however. When people supply finance, whether by loan or by buying shares, they are aware of the institutional arrangements prevailing and are unlikely to ask for or expect a lower return when dealing with a limited liability company than with other forms of enterprise. They will ‘pierce the veil of limited liability’ and may adjust upwards their required return to allow for any perceived adverse effect on managerial incentives.18 The advantage of limited liability is that even after these upward adjust­ments have been made, the possibilities opened up by large-scale operations may be more than adequate to compensate. Much depends here, however, on terminology. A single proprietor with unlimited liability would be expected to undertake fewer projects than would be the case after turning the enterprise into a limited company. Even if the available projects were identical in the two cases, the additional risks faced by a proprietor will induce him or her to apply a higher discount rate, and fewer of the projects will yield expected returns which exceed the ‘cost of capital’. Thus, ‘the cost of capital’ to the enterprise, interpreted in this way, is very likely to be lower in the limited company, but this is just another way of saying that risk­bearing costs are lower to the decision-makers.

Although Ekelund and Tollison emphasise the transferability of shares as a crucial force in the origins of the corporation in the sixteenth and sev­enteenth centuries, by the mid-nineteenth century the risk-sharing charac­teristics of the corporate form with limited liability appear to be a more decisive consideration. Hannah (1983a, p. 23) reports that, in the UK, eighty per cent of joint-stock companies were private not public as late as 1914. Private companies are those which specifically restrict the right to transfer their shares while retaining the other characteristics of the joint- stock form, including limited liability. It is instructive to consider the pos­sible reason for this popularity of the private company in the UK into the twentieth century.

In terms of property rights, the fundamental characteristic of the cor­porate form is that the ‘right to claim the residual’ is separated from the ‘right to monitor the inputs’. This ‘separation of ownership from control’ has certain transactional advantages, as we have seen, and it permits the development of a class of specialist managers, but it also confronts the problem of managerial incentives which was so central to our earlier dis­cussion of team production. The suspicion that joint-stock enterprises would result in inefficient if not corrupt management has a long history. Adam Smith, for example, wrote that ‘negligence and profusion, therefore, must always prevail, more or less, in the management of the affairs of a joint stock company’.19 Such ‘negligence and profusion’ will not prevent the emergence of the joint-stock form if potential efficiency gains exist which are sufficient to compensate, but it is clear that the problem of man­agerial incentives is central to this form of enterprise. The use of managers from a restricted family circle, each with a considerable shareholding and with limited ability to dispose of their holding, as in a private company, can obviously be viewed as a response to the incentives problem. Even suc­cessful public companies at the turn of the twentieth century in the UK used management from the families which founded and built the firms during the nineteenth century; firms such as J. and P. Coats, Imperial Tobacco, and Watney Combe Reid. As Hannah (1983, p. 24) remarks, ‘while this solved a fundamental problem of the corporate economy – that of maintaining managerial efficiency while divorcing ownership from control – it did so more by avoiding the issue than by devising new tech­niques of incentive and control’. In the United States, on the other hand, the development of the corporate form occurred more rapidly than in the UK, and innovation in corporate structure was more advanced. Indeed Chandler (1977, 1990) attributes backwardness in the UK up to the 1940s to the influence of family management and the failure to develop sufficiently quickly a class of professional managers.20 How far the former was the cause of, rather than a rational response to, the latter is, however, a moot point.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Property rights and managerial theories of the firm

By the early 1960s, the large professionally managed corporation was such a familiar part of the institutional landscape that it began to influence the thinking of economists about the firm. A series of ‘managerial’ models of the firm appeared during these years. They all had the same essential structure. Surpluses could be generated within the large firm. These were interpreted as resulting from the exploitation of a degree of monopoly power rather than of entrepreneurial talents. The firms were under the ‘control’ of the managers but these managers faced constraints on their behaviour. Thus the managerial tradition in the theory of the firm was capable of generating an enormous variety of models depending upon the objectives assumed of the managers and the way the constraints on their behaviour were handled.

The most celebrated managerial models are those of Baumol (1959), Marris (1964) and Williamson (1964). They are distinguished primarily by the assumed objectives of the managers. Baumol suggested that managers maximise revenue from sales, Marris that they maximise growth, and Williamson that they maximise a utility function including ‘staff5 or ‘emol­uments’. In each case, the existence of monitoring from outside and limits to managerial discretion were recognised. Baumol included a minimum profit constraint in his model, and Marris similarly incorporated a valua­tion ratio constraint to reflect pressure from shareholders. The valuation ratio is the market value of outstanding equity shares divided by the book value of the assets of a firm. Too low a valuation ratio will involve a risk of takeover ‘unacceptable’ to the management (Marris, 1963, p.205).

In general, the conclusion of these models was that managerial firms would produce higher output, employ more staff, or grow faster than stockholder-controlled ones. In other words, the costs to the stockholders of asserting their influence gave a margin of discretion to managers which they were assumed to use in pursuit of whatever objectives the analyst felt could plausibly be imputed to them. Figure 4.4 provides a diagrammatic representation of Williamson’s staff model and Marris’s growth model. In the case of part (a), the recruitment of more staff is assumed first to add to profit but after a point to cause profits to decline. The manager’s utility is maximised at point m to the right of the profit-maximising level of staff. For Marris, the figure (part (b)) is again basically the same, with the hori­zontal axis now measuring the rate of growth and the vertical axis the val­uation ratio. The constraint will not emanate from the origin but is expected to have the same concave shape. If growth is pushed past a certain point the value of shares on the market will fall as diseconomies associated with staff training are encountered (Penrose effects21) and as a greater proportion of earnings are retained in the firm to finance expansion instead of being paid in dividends to shareholders.

Later in the 1960s and 1970s, the development of property rights theory and recognition of the importance of ‘studying the property aspect of behaviour’ led to the extension of managerial models into different insti­tutional contexts. Niskanen (1968) introduced the property rights approach to the study of bureaucracy, Furubotn and Pejovich (1970, 1971 and 1974) analysed the labour-managed firm and the Soviet firm within a property rights framework. In Chapters 10 and 11, some of this literature is reviewed in more detail. At the most general level, however, the man­agerial tradition, while recognising that moral hazard problems were important in particular circumstances, did not directly address the ques­tion of the mechanisms which might be used to overcome them. Implicitly, the whole approach was concerned with the problem of the principal-agent relationship. Contractual and monitoring responses were not investigated in detail in the managerial tradition. Chapters 8 and 9 of this book are concerned with the modern analysis of the corporation from a principal-agent perspective.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Property rights and transactions cost approaches to the firm

1. ‘Ownership’ of the Firm

The firm in transactions cost analysis is a ‘nexus of contracts’ as explored in Chapter 2. These ‘contracts’ represent promises between participants in the firm. The ability and incentive of the participants to fulfil their con­tractual obligations will depend upon their assignments of property rights. Within the firm, property rights in resources are assigned to the various participants. Different assignments of property rights can then be seen as characterising different types of organisation (for example, the proprietor­ship, the partnership and the joint-stock enterprise) as discussed in sections 5 and 6. If transactors successfully pursue efficiency gains, property rights will be assigned in such a way that the gains from trade net of all costs of transacting are maximised. This type of analysis of property rights is thus complementary with transactions cost theory.

An interesting aspect of the transactions cost theory as summarised above, however, is that the concept of ‘ownership’ is not well defined. Assets seem not to have ‘owners’. Rights to use assets and to claim income flows are simply divided up between cooperating groups of people in ways which facilitate mutual gain. The quote from Demsetz (1967) at the end of section 1 simply mentions ‘the owner of property rights’, suggesting that the ‘owner’ is simply a ‘possessor’ or a ‘holder’ of rights. In so far as a more specific idea of ‘ownership’ has so far appeared in the analysis, it has implic­itly been associated with residual claims. References, for example, to the ‘division of ownership from control’ suggest that the residual claimants (shareholders) are the ‘owners’ of the firm but no longer are really in a posi­tion to exercise control over the use of the assets.

Given the complexity of property rights assignments – the fact that many people may be contractually permitted to use an asset for specified pur­poses and may claim portions of the resulting profit – is it possible to iden­tify an ‘owner’ from amongst all those who can be seen as holding property entitlements? The answer to this question has led to a view of the firm which differs from the pure ‘nexus of contracts’ approach. In modern prop­erty rights theory, the ‘owner’ of an asset possesses ‘residual rights of control’. The word ‘residual’ here, does not refer to the right to receive profit (the residual). It is used as an adjective, not as a noun. It refers to those rights of control which have not explicitly been ceded to others by means of contract. They are residual rights in the sense that they are ‘left over’ and remain with the owner. The most important thing about these ‘residual control rights’ is that they cannot be exhaustively listed and written down. If they could be so listed, they would not be ‘residual’ rights. They would simply be one subset of specified property rights – no different in principle from anyone else’s non-overlapping subset of specified rights which, when summed together, exhaust all possibilities. If every contingency can be con­tracted for, the ‘owner’ disappears. ‘Ownership’ is important because all contingencies cannot be contracted for. When contingencies occur for which there is no contractual provision, it is the ‘owner’ who has the right to determine how an asset should be used.

Ownership is thus intimately associated with contractual incomplete­ness. In subsection 2.1 we saw that ‘private rights’ in housing could be divided between landlord and tenant. What distinguishes these subsets of rights? In principle, the tenant’s rights are delineated in a contract. The landlord retains all those rights which are not explicitly contracted away for a specified length of time. Thus, the landlord is the ‘owner’ of the housing even if he or she cannot live in it for the time being. It might be objected that even the tenant’s rights cannot be exhaustively written down and that interpretation and ‘gap filling’ by the courts will be important. To this, the response is simply that, in their attempts to fill gaps in contracts, the courts are trying to discover what comes within the domain of the contract and what is outside it. Providing there is an acceptance that some matters are not, and were never intended to be, contractually determined (the tenant’s subset of rights is in some sense ‘bounded’), the ‘owner’ with his or her ‘unbounded’ set of residual rights has a distinct and important role.

Recognition of the distinct function of ‘ownership’ has resulted in the development of two rather different (though complementary) types of the­oretical analysis. In the first, associated with the work of Hansmann (1996), the ‘nexus of contracts’ view of the firm is maintained and extended. In the second, associated with the work of Grossman and Hart (1986), attention is directed at the problem of determining the circumstances under which there will be advantages to different assets coming under the control of a single ‘owner’ (integration within the firm) and the circumstances when it will be better to leave the assets under the control of different owners. The ‘firm’ for Grossman and Hart is defined by the common ownership of a set of physical assets, not by the nature of the contracts which bind it to its sup­pliers. In subsections 7.2 and 7.3 we outline these two approaches to ‘own­ership’.

2. Hansmann and the Costs of Ownership

For Hansmann, the firm is the hub of a set of contracts with its ‘patrons’. Patrons are simply any people who find it useful to deal with the firm. They might be workers, providers of capital, consumers of the firm’s output, or suppliers of other inputs such as raw materials, professional services or intermediate goods. Hansmann is not concerned with the nature of these contracts. Whether they are durable and relational or arm’s length and con­ducted on spot markets is not the focus of attention. The main feature of concern to Hansmann is that some patrons will be ‘owners’ and others will not. ‘Owners’ are those patrons who possess ‘residual rights of control’. They are also likely to possess rights to the residual; that is, to the profit. It is usually efficient that control rights and profit rights are held together, although they might, in principle, be held by separate people.

What determines the allocation of ownership rights among the patrons? Efficiency requires that they are assigned to maximise the gains to trade after allowing for all costs of transacting and the costs of ownership. Transactions costs have been discussed in detail in Chapter 2 – the costs of search, bargaining and contractual opportunism. Ownership costs are new. In so far as ownership is associated with claims to profit, one cost is the bearing of Knight’s ‘uncertainty’ – uninsurable risk. The owners have to accept fluctuations in the value of the firm’s ‘equity’, which in the joint- stock enterprise is represented by the market value of shares but in other types of enterprise will not take such a clear-cut form. Another cost given a central role by Hansmann is the cost of exercising residual control rights. Again, it is possible to see this cost as bound up with entrepreneurship. At certain points, where contract is silent, owners determine the use of assets. If we envisage such owners as single individuals, they are entrepreneurs of the variety discussed by Casson – making judgemental decisions. If, as is common, ownership rights are shared between many patrons, they are the group which must determine overall priorities and appoint and monitor the managers and agents who will act on their behalf.

The cost of controlling managers and agents is of central importance in the theory of the firm and is discussed in detail in Chapters 8 and 9. We have already seen in section 5.3.2 that the incentive to monitor in a partnership is undermined as claims to the profit are shared more widely. Hansmann’s distinctive contribution, however, is to emphasise the costs of collective decision-making as a category of ‘ownership cost’. If own­ership is shared between patrons, they must exercise their control rights by means of some ‘collective choice mechanism’. The individual prefer­ences of the owners have to be aggregated and somehow transformed into a decision which can be said to ‘represent’ them as a group. The owner­ship of enterprise is thus closely connected to the problems of ‘public choice’ – not at the level of the state, but at the microeconomic level of the firm. Each firm has a ‘constitution’ which will specify how the ‘owners’ are to exercise their collective rights. Collective choice processes are not costless, however. Each person faces the costs of becoming informed and deciding where their own interests lie. In this function, as in the monitoring function, each owner might try to free-ride on the work of others. They might calculate that their individual influence on the outcome was likely to be so slight that extensive information gathering and analysis would not be individually worthwhile (the problem of ‘ratio­nal ignorance’). They might not even bother to register their preferences and take part in whatever voting processes are used (the problem of ‘rational abstention’). Even where informed owners register their prefer­ences, the choices of the group as a whole might be paradoxical and unre­lated to these individual preferences (the problem of ‘preference aggregation’ or ‘the paradox of voting’).

The simplest and most famous demonstration of the ‘paradox of voting’ is as follows. Suppose there are three voters who must decide collectively which of three available mutually exclusive choices they should make. Voter 1 ranks the options in order of preference ABC. Voter 2 ranks them in the order BCA and voter 3 in the order CAB. If the three people take each pair of options in turn and vote according to a majority rule, they will find that, collectively, they prefer A to B, B to C, and C to A. This is the classic ‘voting cycle’. Assuming the final choice to be made by elimination, the outcome would be arbitrary and depend entirely upon the order in which the options were put to the vote.22 It is not true, of course, that a voting cycle will always be generated. If the rankings of the voters were ‘sufficiently similar’ the problem would not necessarily arise. Collective choice will obviously be a great deal easier in circumstances where there is a fair amount of agreement between the voters. This leads to the conclusion that, where ownership is shared between many patrons, collective decision-making costs will be lower if these patrons comprise a reasonably homogeneous group.

Hansmann argues, therefore, that minimising the costs of the firm’s transactions will imply the allocation of ownership rights to two groups:

  1. to those who would otherwise face relatively high costs of market transacting; and/or
  2. to those for whom the costs of ownership are relatively low.

The first point above derives from the fact that ownership can be a sub­stitute for contract. If post-contractual opportunism is a serious problem, for example, the patrons fearing ‘hold-up’ might become owners. They will now have control and will therefore avoid the dangers involved in ‘contract renegotiation’. Similarly, if another group of patrons faces severe adverse selection or moral hazard problems, ownership might mitigate the difficulty, providing that monitoring costs and other costs of ownership are not too great. In Chapters 10 and 11, we will apply this type of reasoning to the worker-owned firm as well as to consumer cooperatives, mutual enterprises and non-profit firms.

The second point simply emphasises that it is no good avoiding the costs of contract if the costs of ownership are even greater. As will be seen later in Part 2, it is high costs of ownership that lie behind many of the problems of worker-owned firms and non-profits. Workers tend to have heteroge­neous interests unless ownership is confined to a small group such as in a professional partnership. In contrast, a major advantage of the investor- owned joint-stock enterprise, notwithstanding the problem of managerial monitoring which will be discussed in Chapters 8 and 9, is the relatively homogeneous interests of the group of patrons who own it.

3. Grossman and Hart and the Property Rights Theory of the Firm

Hansmann’s approach to the ‘ownership’ of the firm is rooted in transac­tions cost analysis. In contrast, the property rights approach to the firm as developed by Grossman, Hart and others does not focus on transactions costs. It starts by asking the question: why does it matter who holds resid­ual rights of control in an asset? What is the nature of the social benefit derived from an ‘efficient’ rather than an ‘inefficient’ allocation of control rights? Hansmann and other transactions cost theorists would answer that the social benefit takes the form of lower costs of transacting. Property rights theorists argue that this answer is not precise enough. They show this by imagining a world in which bargaining costs are actually zero but in which long-term contracts are incomplete. Some promises are simply ‘non­verifiable’. This does not necessarily mean that behaviour or results are unobservable. It means that it is impossible to construct a contract which a court or some third party can interpret accurately in order to tell whether a promise has or has not been met. One party might promise to publish a book of a particular word length. This would easily be verifiable. Once the nature and quality of the book become important, the terms of the con­tract become ‘non-verifiable’. The manuscript exists, its contents are observable, but is it what the publisher was expecting and had contracted for?

The property rights theory of the firm is based upon the idea that the assignment of residual control rights matters in a world where some issues concerning ‘quality’ are non-contractible. Property rights matter not because they determine the costs of transacting. ‘Quality’ is non-contractible, irre­spective of how property rights are assigned, thus effectively rendering a long-term enforceable contract ‘infinitely costly’. Property rights matter because they influence the ‘power’ that transactors have in their (zero-cost) post-contractual bargaining when the provisions of a contract are non­verifiable. Why, though, should the distribution of such ‘power’ matter to society as a whole? Will not one person’s loss be another person’s gain when this post-contractual bargaining happens? The answer is that people are forward looking. They know that their contractual arrangements are incom­plete and that they cannot rely on third parties to enforce all provisions. They know, therefore, that post-contractual bargaining will take place and that ‘hold-up’ will occur. This certain knowledge that they are vulnerable to hold­up will influence behaviour. In particular, it will influence each party’s will­ingness to make ‘ex ante’ transaction-specific investments.23

As we saw in Chapter 2, transaction-specific investments are those which raise the return to a particular contractual relationship but which have a lower payoff outside. A supplier, for example, may spend time and resources learning how best to satisfy the requirements of a buyer. These are invest­ments in ‘performance’. A buyer may expend resources preparing for the delivery of the supplier’s goods or services. A restaurateur, for example, may invest in publicity and in planning menus prior to the delivery of inte­rior designs. These are investments in ‘reliance’. The essence of the prop­erty rights theory of the firm is that the combined incentive to invest in performance and reliance will depend upon how property rights in physi­cal assets are assigned.

The contractors know that their ex ante investments are non-verifiable. They know that after they have made these investments they will bargain about the distribution of the ex post gains from trade. Each will want to be in as strong a position as possible when this bargaining takes place. A ‘strong’ position in this context is the ability to walk away from the agree­ment with as little penalty as possible. In other words, the person with the most valuable outside opportunities will be in a stronger position than a person who is very ‘dependent’ on the relationship and has few opportuni­ties elsewhere. Each person, therefore, will have a ‘threat point’, a payoff below which he or she cannot be squeezed because they will be better off simply terminating their existing agreement and trading with someone else.

Holding residual control rights in physical assets is important because, in the event of a breakdown in an agreement, such rights might enable alter­native possibilities to be pursued. Consider our example of the restaurant. Imagine that there is a single physical asset – the building itself. Who should be assigned the control rights in the building – the restaurateur (chef) or the interior designer? Each might be expected to make greater ex ante invest­ments if they own the building than if they do not. If the designer owns the building and his or her contract with the restaurateur breaks down, he/she can at least open the restaurant using the services of some other chef hired in the market and so install his or her interior designs. Similarly, if the restaurateur owns the building s/he can, on the collapse of the original agreement, go ahead and open the restaurant even if the interior is not as attractive as s/he had hoped. The crucially important question is therefore the sensitivity of investments in performance and reliance to the possession of residual control rights. The greater the sensitivity of a contractor’s ex ante investments to control of a physical asset, the more powerful the case for the assignment of ownership to that contractor.

Without the availability of the building, the restaurateur will find adver­tising and planning to no avail. Perhaps a different location could be found and a lease arranged, but geographical position and exterior appearance are important especially as these might have featured prominently in mar­keting efforts. Failure of the contract with the building-owning interior designer would therefore be expected to be serious for the restaurateur. He or she might consider the whole investment effectively wasted. Control of the building would be likely to provide a powerful incentive to greater investment on the part of the restaurateur because there is the reassurance that, even without the interior designs, the investment will be productive. Is the position of the interior designer similar?

For the sake of argument, suppose that the designer has other outlets for his or her services. Although the designs for the restaurant are, to some degree, specific to the original restaurateur’s requirements, the ideas can be used, with some not too costly modifications, in restaurants, clubs or pubs elsewhere. This implies that, although his or her effort will be somewhat less if s/he does not own the building, the assignment of ownership to the restaurateur will not greatly diminish the designer’s ‘up front’ commitment. Changing the ownership of the building from the designer to the restaura­teur will radically increase the latter’s ex ante investment and only margin­ally reduce that of the former. Thus, the restaurateur is the most efficient owner.

Another way of expressing this result is to note that, as this story has been set up, the restaurateur’s human capital is complementary to the phys­ical asset to a much greater extent than is the designer’s human capital. Ownership of the physical asset does not help the designer much. S/he needs the chef’s human capital before the project becomes really success­ful. In the absence of the chef’s human capital (that is, if the contract breaks down and the agreement is terminated) the designer gains little from own­ership of the building. The chef’s human capital is thus ‘essential’ in the technical sense that, without it, the designer gains nothing from ownership of the physical asset. If the human capital of one of the parties to a con­tract is ‘essential’, it is this party that should own the asset.

Although the threat to trade elsewhere and renounce a non-verifiable agreement plays a central role in property rights theory, it would be wrong to conclude that agreements are likely to break down. On the contrary, because bargaining costs are assumed to be zero, ex post bargaining is always successful and the ex ante specific investments in human capital undertaken by the contracting parties are never (theoretically) wasted. It is this feature that points up the highly distinctive nature of modern property rights theory. A transactions-cost approach would tend to emphasise the use of governance mechanisms to make the breakdown of contract less likely. The property rights theorising sketched here does not investigate such issues because (perhaps somewhat paradoxically) contracts always survive ex post bargaining. In fact, it is usual to invoke the Nash solution to a cooperative two-person bargaining game which (assuming contractors have similar preferences) predicts that the gains from trade will be shared equally between the bargainers.24 Thus, our contractors not only predict that they will have to engage in ex post bargaining; they also predict what will be the outcome of this ex post bargaining. They each receive their ‘threat value’ plus one half of the gains available from a successful agree­ment. Their ex ante investments are then based upon these accurate fore­casts, and the optimum assignment of ownership rights maximises the resulting total investment in performance and reliance. An algebraic pre­sentation of the theory based upon the work of Hart (1995a) is presented in the notes at the end of this chapter.

Thus far the analysis has focused on the problem of deciding which of two contractors should hold the residual control rights to a single physical asset. If we now imagine that there are two assets involved, further conclu­sions can be drawn about how the rights in these will be distributed. The physical assets, to follow our example further, might be the building and kitchen equipment. It seems natural to expect the restaurateur will control both the building and the kitchen equipment. We would not anticipate the designer owning the kitchen equipment and the restaurateur the building. Why do we not think such an arrangement would be reasonable? The formal answer is once more based upon the willingness of the contractors to make ex ante transaction specific investments. The restaurateur will undoubtedly conclude that the control of the restaurant, in the event of ter­mination of the agreement with the designer, will be of no avail unless he or she also has access to the kitchen equipment. Willingness to invest in reliance is not increased by control of just one of these assets; s/he requires control of both. Given that the designer will not increase his or her invest­ment in performance if s/he controls the kitchen equipment, it is obvious that the chef should control both assets. They can be regarded as a single ‘composite’ asset, to be allocated on the principles discussed above. The two assets in this example are ‘strictly complementary’ and strictly comple­mentary assets should be held together.

In contrast, we might imagine that the second asset consists of some spe­cialised computer equipment used in design work. No one would expect the restaurateur rather than the designer to own this equipment, but can we explain this commonsense response more formally? In the event of the ter­mination of their agreement, the restaurateur gains nothing from his or her control of design equipment. On the other hand, the designer might find it impossible to undertake the modifications necessary to take advantage of alternative opportunities without it. His or her ex ante investment in per­formance will be adversely affected if s/he does not control the design equipment. The chef’s ex ante investment in reliance is not affected either way. Thus, the designer should hold the residual control rights in the design equipment. The two assets are now ‘independent’ and independent assets should be held separately.

Coase’s theory of the firm saw internalisation as a means of reducing the transactions costs of using markets. Given, however, that incentives are still a problem within the firm and that moral hazard and other difficulties do not go away just because a transaction is conducted within the firm rather than outside, the precise source of any advantage to integration is left unspecified in the Coasian analysis. Durable, long-term agreements sup­ported by reputation or indeed supported by a degree of monitoring are possible across ‘markets’ as well as in ‘firms’. As was seen in Chapter 2, Williamson (1985) emphasises the firm as a system of ‘governance’. In a world of specific assets and opportunism, the firm establishes structures for the governance of long-term incomplete contracts. For Williamson, these governance arrangements lower the cost of long-term contracting. Property rights theory also starts from the acceptance of contractual incompleteness, asset specificity and opportunistic post-contractual bar­gaining. The essence of the firm, however, is not that it is an effective gov­ernor of contracts but that it owns a set of physical assets which induce optimal ex ante investment in performance and reliance between contrac­tual agents.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Entrepreneurship and property rights

1. ‘Ownership’ and the Entrepreneur

Property rights theory as developed by Grossman, Hart and others in recent years is in the neoclassical tradition of analysis. It assumes rational utility-maximising behaviour in the presence of a known set of constraints. It is not Walrasian in the sense described in Chapter 1, with its assumption of a full set of markets for all goods and services. In property rights theory, some things are not contractible. Property rights theory is thus in the tra­dition of ‘contested exchange’.25 There are elements to the theory, however, which have implications for the issues which we began to develop in Chapter 3. There we argued that although conceptions of the role of the entrepreneur varied – whether the entrepreneur was an uncertainty bearer (after Knight), an alert intermediator (after Kirzner), a maker of judge­mental decisions (after Casson), or a technological innovator (after Schumpeter) – the one feature in common was that entrepreneurial services were themselves not contractible. You cannot promise to supply entrepre­neurial services because (in the language of property rights theory) such a contract would be unverifiable.

Entrepreneurs achieve pure profits through trading in property rights. These property rights do not have to be residual control rights. Any exchangeable right is capable of generating pure profit to the entrepreneur. From Kirzner’s point of view, for example, a person who buys a lease on some domestic property and then sells to a person who values it more highly gains an entrepreneurial rent. The fact that ‘ownership’ of the prop­erty has not changed hands does not matter; the entrepreneur might or might not be the ‘owner’ of the property. However, although ownership is not a necessary condition for the exercise of the entrepreneurial function, we might still wonder whether ownership is sometimes helpful to the entre­preneur.

The property rights theory reviewed in section 7.3 hinges on the idea that ‘ownership’ matters when contract fails. Consider now the position of an entrepreneur wishing to undertake a particular plan of action. If the entre­preneur owns the physical assets which are necessary to the successful com­pletion of the plan, there is no need to ask anyone else’s permission to go ahead. The danger with having to negotiate with an owner is that the owner will then be able to demand a share in the entrepreneurial profit. Does this matter? Somewhat paradoxically, an Austrian theorist such as Kirzner would find it difficult to prove that it did matter. This is because Kirzner asserts that entrepreneurial insight is ‘costless’. Combine Kirzner’s costless alertness with Hart’s costless bargaining and the sharing of some of the profit with an asset owner will not reduce the amount of entrepreneurial alertness or the social gains achieved.26 Relaxing these assumptions of cost­less alertness and bargaining, however, makes the assignment of ownership rights important for entrepreneurial activity.

If bargaining is not a costless and efficient process, ownership of resid­ual control rights by the entrepreneur will economise on bargaining costs and greatly facilitate entrepreneurial activity. This is an important Austrian’ theme which is taken up in Part 3 where the social importance of the assignment of control rights to ‘competence’ is discussed in more detail. Similarly, if entrepreneurial ideas require the commitment of time, energy and resources in ex ante non-contractible investments, the Grossman-Hart property rights approach can be applied directly to conclude that the nec­essary physical assets should be owned by the entrepreneur whose human capital is likely to be ‘essential’ to achieving the best results. This would be so, even if the cost of bargaining with the asset owner were zero ex post. The entrepreneur would predict that a portion of the value of his or her dis­coveries would be appropriated by the asset owner and would thus cut back on the level of ex ante investment.

A major reason for the lack of dynamism associated with socialist regimes during the twentieth century was the difficulty faced by most people in putting any entrepreneurial plan into action, because rights of control were not assigned privately. These control rights were also separated from profit rights. In fact, profit rights and control rights are highly complemen­tary. If I possess profit rights but no control rights, I will expect to have to share some of my profit with the controller before I am permitted to go ahead with an entrepreneurial initiative. Conversely, if I have control rights but no profit rights, I will expect much of the value of my entrepreneurial activities to be harvested by the holder of profit rights. Holding one of these rights is not enough. I need to hold both together before the return to my non-contractible investments is protected. The close association between profit rights and control rights – the fact that these are usually seen as a ‘package’ constituting ‘ownership’ – thus follows naturally from property rights analysis.

2. The Property Rights Approach to the Finance of the Entrepreneur

In Chapter 3 we introduced Kirzner’s (1979, p. 94) idea that entrepreneur­ial profits are captured by those ‘who exercise pure entrepreneurship, for which ownership is never a condition’. By this statement, Kirzner is empha­sising the point that entrepreneurial gains are a distinct category and nothing to do with the conventional return to capital. Precise textual analy­sis of Kirzner’s treatise does not concern us here, but it will be useful to interpret his use of the word ‘ownership’ at this point as meaning simply ‘wealth’. Kirzner is not making distinctions between ownership and other property rights but is simply making the challenging point that you do not have to be wealthy to be an entrepreneur. He notes that a capitalist might be induced to lend funds at interest and that this might permit pure profit to be earned without any net investment on the part of the entrepreneur.

Property rights theory provides a tool for investigating this claim in more detail. Obviously there is a level at which the possibility of successful entre­preneurship with zero wealth is a matter of pure tautology rather than pos­itive economics. Maybe there really are capitalists somewhere who might be induced to lend to entrepreneurs with zero wealth. Possibly the entrepre­neur has discovered an opportunity that can be pursued by means that are not inherently non-contractible. Nevertheless, the commonsense observa­tion that it is less difficult to act entrepreneurially if you have some wealth than if you do not can be analysed more systematically and is capable of yielding insights concerning the use of debt in financial contracting.27

Consider, for example, an entrepreneur who has no wealth (for the sake of convenience we shall take this entrepreneur to be a woman). She dis­covers an opportunity that will result in a flow of revenue in the future. The revenue flow will be R1 at the end of period 1 and R2 at the end of period 2. There is no uncertainty in this model which gives it a rather Kirznerian flavour. In order to achieve this revenue flow, the entrepreneur must invest in an asset costing K0 at the beginning of period 1. The asset depreciates over the two periods. At the end of period 2 it is worthless. Assume that the rate of interest is zero so that this project is worth undertaking if

Ko< R + R2                                                  (4.1)

If there are no contractual difficulties confronting capitalist and entrepre­neur, we can imagine the entrepreneur borrowing K0 from the capitalist. The entrepreneur could make repayments from the revenue stream spread over the life of the project. The pure profit might be split between capital­ist and entrepreneur.

In fact, of course, the entrepreneur and the capitalist face a very haz­ardous contractual environment. These hazards can be introduced by assuming that R1 and R2 are unverifiable. It is always possible for the entre­preneur to claim that she had not, after all, received any revenues and that the project had failed. She could regretfully announce this news even as she is making secret deposits in a foreign bank account. This is an extreme sit­uation. It is not denied that social mechanisms exist to help enforce debt repayments and overcome hazards. If the project is a local one, and if the lenders of funds are neighbours, the power of peer pressure and other mon­itoring devices might be used. These matters will be considered in Chapters 10 and 11. The assumption of the non-verifiability of revenue flows, however, helps to focus attention on the root of the contractual problem and is a useful basis for modelling potential responses.

The first thing to note is that, with non-contractible flows of revenue, the project is unlikely to get financed if the entrepreneur has no wealth. There is, however, one possible escape route. If the asset does not depreciate at all during period 1, the entrepreneur could make the following offer to the cap­italist. ‘If you will lend me a sum K0, I will undertake to repay this sum at the end of period 1. If I fail to make this repayment, you may take over the ownership rights in the assets which will have a liquidation value at that time of K0.’ This has the features of a classic debt contract. Ownership of the asset resides with the entrepreneur for so long as an agreed schedule of repayments is adhered to. Failure to meet this schedule of payments will lead to ownership rights passing to the capitalist. Clearly, if K0 is used to acquire assets these will normally depreciate so that the entrepreneur’s offer mentioned above will not generally be possible. Perhaps the nearest approach in practice would be the finance of a farmer by a mortgage on the land. Nevertheless, the main point is that the entrepreneur can gain access to debt finance providing she has wealth enough to cover the depreciation on the asset during period 1. If the asset has a liquidation value of K1 at the end of period 1, the entrepreneur will be able to borrow K1 during that period of time. She will therefore require wealth of K0 — K1 to start her project. We will assume initially that the entrepreneur uses all her available wealth ( W) to get her project off the ground (W =K0 — Kl).

Starting the project is all very well, but will she be able to finish it? It is not obvious that it will be worth borrowing K1 and investing her personal wealth K0 — K1, in the certain knowledge that liquidation will occur at the end of period 1. How will she make the payment Kj at that point? One answer might be that the revenue Rj is more than enough to pay off the debt; that is, Rj > Kj. The project would certainly have to be extremely profitable for this to be the case but it is a logical possibility. Notice that, if the entrepreneur is not to liquidate the project herself, the revenue at the end of period 2 must also exceed Kj. Thus we assume R2 >K1 to ensure that liquidation of the project at the end of period 1 is not the efficient thing to do in any case. We conclude that there might exist some projects with such attractive cash flows that the entrepreneur will be able to use them to finance her borrowing. The capitalist will have security for his loan and the entrepreneur the means and the incentive to repay at the end of period 1. Note also that, for a given level of investment K0, the smaller is the depre­ciation of the assets during period 1 the more the entrepreneur can borrow, but the greater the cash flows have to be at the end of periods 1 and 2 to enable and to warrant the repayment of debt. Entrepreneurs with very modest wealth will therefore only be able to finance projects using assets which depreciate slowly at first, which are not highly project-specific and which have cash flows which imply a very high level of profitability.

There is, however, another possibility. In the above paragraph, the implicit assumption was made that the full project either achieved com­pletion at the end of period 2 or was liquidated at the end of period 1. Perhaps, however, the entrepreneur could herself partially liquidate the project at the end of period 1 using the proceeds of asset sales to repay her loan to the capitalist – thus retaining control rights in the assets that remain. Even with R1 < K1, therefore, the entrepreneur need not give up her plans for period 2, pocket R1, go into liquidation and hand over all control rights to the capitalist. She could borrow K1 and repay partly from period 1 revenues but partly from asset sales. This would only be possible, of course, if the project were flexible enough to be scaled back. In order to meet her obligations to the capitalist she would have to sell a proportion (K1 — R1)/K1 of the assets at the end of period 1. This would leave her in possession of R1/K1 of the assets from which we assume she could derive a revenue of (R1/K1)R2 at the end of period 2. The entrepreneur therefore invests K0— K1 (still assumed to be equal to her personal wealth) at the beginning of period 1 and receives the return (R1/K1)R2 at the end of period 2. This will be profitable providing that

Since K1 =R1 + (1 — R1IK1)K1, this condition for the entrepreneur to invest can also be written

The last two terms on the right hand side of 4.1′ constitute a weighted average of K1 and R2. Since K1 < R2 we deduce that

In other words, the entrepreneur will not undertake all projects for which the revenues exceed K0. She will undertake only those projects for which the cash flows (allowing for partial liquidation at the end of period 1) exceed K0. Clearly if R1 = K1, the entrepreneur will be able to undertake the project with no inefficient early liquidation, just as discussed in an earlier paragraph.

Up to this point, the entrepreneur’s personal wealth has served entirely to finance the depreciation of the asset during period 1. If she does not have access to this level of wealth, she cannot get the project through the first period. This is because, by assumption, she cannot commit to pay the capi­talist more than the liquidation value of the asset at the end of period 1.28 If, however, her wealth is more than sufficient to cover asset depreciation during the first period she might use the remaining portion, in conjunction with revenue accruing at the end of period 1, to repay the capitalist and avoid some of the inefficient liquidation which might otherwise be necessary. This partial liquidation of assets occurs because she cannot commit to pay the capitalist any of the non-verifiable revenue accruing at the end of period 2.

An entrepreneur will be able to finance a project without partial liquida­tion of assets along the way providing her wealth ( W) exceeds asset depre­ciation in the first period (K0 — K1) plus any shortfall in period 1’s revenue below the liquidation value of the assets (K1 — Rl). Thus:

ensures that the entrepreneur’s plan can be financed without liquidating assets. If wealth exceeds K0—K1 but is less than K0 — Rl the proportion of the project liquidated at the end of period 1 will be (K0 — Rl — W)IK1.

Private wealth is thus important to entrepreneurial initiative in a prop­erty rights analysis. The contract of debt whereby control rights pass to the capitalist in the event of default is indeed a method whereby entrepreneurs can pursue their ideas without having to finance it all themselves. But some private wealth assists entrepreneurship. The greater the depreciation of assets in period 1 (perhaps the more project specific the capital) the more private wealth is required to finance the project. Similarly, the more end- weighted the project returns (the smaller the revenues accruing at the end of period 1) the more private wealth will be required to avoid partial liqui­dation of assets.

The role of private wealth in economic development is of considerable theoretical and practical interest. In a recent book, for example, de Soto (2000) asks why it is that capitalism seems to be successful in some coun­tries but apparently fails in others. His answer is that even people living in conditions of great poverty have accumulated in aggregate vast resources which eclipse flows of aid and even far exceed the capitalisation of compa­nies quoted on the local stock exchanges. His main point is that these assets held by the poor constitute ‘dead capital’ because legal title to the assets is absent or hugely expensive to assert. He estimates untitled real estate hold­ings in Haiti to be worth $5.2 billion, in Peru $74 billion, in the Philippines $133 billion, in Egypt $240 billion and, in the third world as a whole, $9.3 trillion.29 Without legal title, the potential energy contained within these assets cannot be realised. ‘What creates capital in the West, in other words, is an implicit process buried in the intricacies of its formal property systems’ (p.39).

In the context of this chapter, we might interpret de Soto’s argument along the following lines. The existence of legal title and the ability to trade ownership rights cheaply and freely is a determinant of the level of per­sonal wealth that people can bring to bear on their entrepreneurial projects. People in many countries live in poverty unable to exercise entrepreneurial talent because the assets which they have developed, which are part of their everyday existence and whose value is collectively enormous, cannot be used in the type of bargains that we have been discussing in section 8. Untitled assets might be valued by ‘think-tanks’ at $9.3 trillion throughout the world, but, for the mass of people, W is effectively zero.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Principal and agent in the firm

1. INTRODUCTION

In chapters 1 to 4, attention has been focused on the difficulties inherent in formulating agreements which permit specialisation and exchange to occur, when information is not ‘public’; that is, costlessly and equally available to everyone. Specialisation permits people to concentrate on those tasks in which they have a comparative advantage (Chapter 1), and it also has more dynamic effects. As Adam Smith1 noted, specialisation results in the acqui­sition of enhanced levels of skill in particular operations as experience accumulates, it may permit the introduction of specialised machinery as various activities are broken down into basic components, and it may reduce the time which would otherwise be spent transferring attention from one job to the next.2 If ‘many hands make light work’ however, they may also give rise to new problems. A person who ‘saunters a little’ between tasks will at least face the costs of that sauntering if he or she is responsi­ble for all stages in the production process. Much of Chapter 4 was devoted to the analysis of situations in which specialisation results in Adam Smith’s ‘sauntering’ turning into Alchian and Demsetz’s ‘shirking’ as people attempt to transfer the costs of their wavering attention on to others. Clearly, Smith, in his example of pin making, was considering a case of division of labour which, we may infer, he did not expect would encounter substantial coordination and policing costs. In general, however, costs of coordination and policing are not negligible (Chapter 2) and the firm itself can be seen as a response to them. These costs underlie the saying that ‘too many cooks spoil the broth’, as well as the often heard remark ‘if you want a job done properly you must do it yourself’.

The tension between the advantages of ‘specialisation’ and the costs of policing and monitoring (the advantages of ‘integration’) is a leitmotif which returns constantly in the theory of the firm. Already in Chapter 4 we have seen how different types of firm represent compromise solutions to the conflicting requirements of specialisation and incentives. In the classical proprietorship, the tasks of capitalist, monitor and risk bearer are ‘inte­grated’ and the problem of managerial incentives thereby mitigated. The partnership and, even more, the joint-stock company permit the ‘disinte­gration’ of these functions and hence the potential advantages which may accrue from exchange – a specialised management and widely spread risks – but the problem of policing and incentives is more pronounced.

In this chapter our objective is to consider the contractual problem faced by principal and agent in greater analytical detail. This will help to clarify the nature of the trade-off between risk-sharing benefits and effort incen­tives, and will provide a useful framework for rationalising various con­tractual arrangements which are observed in practice. At the outset, it is necessary to remember that the economist uses the word ‘agent’ in a much looser sense than does the lawyer. For the lawyer, an agent is ‘a person invested with a legal power to alter the principal’s legal relations with third parties’.3 Thus two partners are one another’s agents in a strict legal sense, since each can bind the other in contractual arrangements with third parties. The economist, however, is, for once, more in line with common usage in seeing the agent as a person who is employed to undertake some activity on behalf of someone else (the principal). This will cover cases of agency in the strict sense, but will also include other cases in which the same or similar incentive problems arise, although the legal term ‘agent’ would not be accurate.4

2. OBSERVABILITY AND THE SHARECROPPER

As we have seen in chapter 2, a principal-agent relation exists when one party (the agent) agrees to act on behalf of another party (the principal). (For the sake of convenience we will take the principal agent, and workers in this chapter to be male.) The problem then exists of devising a ‘contract’ which provides incentives for the agent to work in ways which benefit the principal. Let π be the final outcome of the agent’s activities, and let e rep­resent his level of effort. Sharecropping represents the most commonly used illustrative case for discussing incentive contracts, with the landlord as the ‘principal’ and the sharecropper as his ‘agent’ (in the loose rather than the legal sense of these terms). In this case, π might represent the volume of the crop finally harvested (say bushels of wheat), while e would represent the input of the sharecropper’s time and skill. Now suppose that the final outcome is deterministically related to the sharecropper’s effort

Assuming that both sharecropper and landlord can observe the final outcome and that this outcome is conceptually simple enough to appear in a contract, it is clear that there is no reason for the landlord to monitor effort. The contract merely has to stipulate the outcome desired (say π ), and the payment to be made when it is achieved. Under these circumstances, perfect knowledge of the outcome gives us perfect information about the effort expended, and the result is a contract which is not a ‘sharecropping’ arrangement at all but simply a paid worker contract. The worker receives a specified reward upon completing the job he was hired to do. There is no incentive problem.

A case more germane to the problem of incentives occurs when the outcome depends not only on the labourer’s effort but also upon other chance factors. In the agricultural example, the harvest may depend upon climatic conditions as well as work effort. Thus we might write

where θ represents the ‘state of the world’. For example, θ  might measure ‘inches of rainfall’ or ‘hours of sunshine’ or ‘average temperature in July’ and so forth. Any contract will now involve the sharing of risk in addition to the provision of incentives. Consider, once more, the paid worker con­tract which depends only on the achievement of a target outcome π. The labourer is unlikely to accept such an arrangement since it exposes him to considerable risk. The most careful husbandry could be powerless against adverse weather conditions and the labourer, if he is risk averse, will prefer that some account is taken either of his effort level, or of the state of the world prevailing. Clearly, however, the prospect of including e or θ in a con­tract depends upon whether or not they are ‘observable’. Suppose, first of all, that 0 can easily be verified by both parties. It follows, once more, that observation of the labourer’s effort is unnecessary to provide incentives, and that a preferred distribution of risk can be achieved without con­fronting the shirking problem.

From (5.2) both landlord and labourer will be able to work out the result (π) of a given amount of effort (e) in different states of the world (θ). The labourer’s reward (A) can therefore be made to depend upon both outcome and state of the world: A = A(π,θ). If the landlord receives an amount (P) which depends only on the state of the world P(θ), the labourer’s return would be given by

In other words, the labourer would pay to the landlord an amount P(θ) which depended on the weather, and would keep the rest of the harvest for himself. The results of additional effort always accrue to the labourer, so that no incentive problem arises, while the characteristics of P(θ) enable risks to be shared between landlord and labourer in any way desired. Such arrangements typify the ‘sharecropping’ contract. If, for example, P(θ) were a constant P’, so that the labourer paid the same amount to the landlord in every state of the world, the labourer would effectively be bearing the entire risk and insuring the landlord against the vagaries of the weather. On the other hand, the landlord’s share P(θ) could be so arranged that the remain­der left for the labourer is always the same, providing the labourer puts in the standard effort e’. In this case, it would be the landlord who would bear the risk and the labourer who would receive a definite predetermined return providing the standard effort e’ was forthcoming. If both landlord and labourer were risk averse, we would not expect either to bear the entire risk. Instead P(θ) would be defined so as to share risk efficiently between them. In the next section we will discuss in more detail what it means to share risk ‘efficiently’.

Specifying a mutually agreeable contract becomes more complicated when we assume that the state of the world θ is not observable by the land­lord (or principal). If effort e is unobservable also, then clearly any contract must of necessity depend upon the outcome π alone. It is this case which we will discuss in detail in section 4. Even at this stage, however, the essen­tial character of the problem can be appreciated. The unobservability of the state variable θ and effort e means that it will, in general, be impossible to achieve an ideal distribution of risk between the parties without sacrificing effort incentives. Conversely, the ‘best’ contract achievable will usually involve the sacrifice of risk-sharing benefits in the interests of pro­viding incentives.

Consider the case in which the landlord or principal is risk neutral and the labourer or agent is risk averse. Intuitively, we have already accepted that the risk-neutral partner should ideally bear the entire risk (a proposi­tion which we will justify in more detail in the next section). Where the state of the world 0 is observable, it has already been shown how this distribu­tion of risk could be arranged whilst still eliciting the standard effort e’ from the labourer. The labourer would receive the same reward whatever the state of the world, but only if he exerted the standard effort. Where the contract must depend on the outcome alone, however, assuring the labourer of a given reward is to leave him with no incentive to do anything. If the principal can never form the faintest conception of how hard the agent worked, and can never become acquainted with the difficulties he encoun­tered, these factors cannot enter the contract. In such a case, however, a promise to pay a fixed determinate sum to the agent would be to make his remuneration completely independent of effort. The labourer would receive the same number of bushels of wheat from the landlord even if his only acquaintance with the fields to be cultivated occurred as he passed through them during his daily journeys to and from the village pub. Clearly, the provision of effort incentives requires that the labourer receives a bigger payment if the harvest is big than if the harvest is small, but because the harvest depends on the chance factors θ, this implies that the labourer must shoulder some risk, even though his landlord is risk neutral and would, in conditions of ‘observability’, provide him with complete insurance.

The above paragraph suggests that information about the labourer’s effort would be valuable in enabling both parties to a contract to achieve preferred positions. If we continue to assume that θ is unobservable by the principal, some observation of the agent’s effort e might clearly benefit both parties. By checking that the labourer sowed the correct type of seed, applied the appropriate fertiliser and so forth, it would be possible to move towards the ideal distribution of risk without diminishing effort incentives. In the extreme case of perfectly observable effort, the (risk-averse) labourer would receive a payment dependent entirely on effort and would thus face no risk, while the (risk-neutral) landlord would receive the residual harvest. Monitoring will not usually be so reliable, however, and a further interest­ing question is whether information about effort containing errors, which are subject to a known statistical distribution, could be incorporated in a contract to the advantage of both parties. The landlord might check at random whether the labourer is actually working in the field. An unlucky labourer could find that the spot check occurred during the only five minutes he was away, and a lucky one that it occurred during the only five minutes he was there. A series of such checks will provide information which, although not perfectly accurate, nevertheless contains potentially usable information about effort. Indeed it can be shown,5 that, irrespective of the ‘noise’ associated with the information, there will always be an advantage to incorporating an informative signal into a contract if the agent is risk averse (assuming that the signal is costlessly received and that costs of writing the contract can be ignored). A more detailed illustration of what it means for a noisy signal to be ‘informative’ will be given in section 4.

In the case of a risk-neutral labourer or agent, information about effort or state of the world will be valueless. Even in conditions of complete non­observability of effort and state, all available risk-sharing benefits can be achieved without sacrificing incentives. As we noted above, a risk-neutral labourer will shoulder the entire risk and pay a fixed fee to the landlord. Because the fee is the same whatever the state of the world that occurs, it is clearly not necessary to observe the state. Neither will observation of the agent’s effort confer any benefits, since the agent will personally face the consequences of any ‘sauntering’ and cannot unload the costs on to the landlord. In effect, the agent simply pays a fixed fee for the use of the resources owned by the landlord for a specified period of time. This will obviously avoid any dangers associated with shirking, while the risk- neutrality assumption ensures that this arrangement is compatible with the efficient distribution of risk. The next two sections illustrate some of the above results using a set of simple diagrams.

3. RISK SHARING

Consider a case in which two people (person A and person P) wish to share the risk implied by a fluctuating harvest. Suppose that the harvest or outcome can take only two values π1 and   π2  with π1 > π2. For the present,

we ignore the influence of effort on the harvest and simply assume that π1 and π2 occur with probabilities p1 and P2 = (1—Pj) respectively. Each person, we assume, will be entitled to a given portion of the harvest depend­ing on whether the harvest is good or bad. Thus,

where π1A is person A’s entitlement when the harvest is good, π2P is person P’s entitlement when the harvest is bad, and so forth. Presented with various different combinations of π1A and π2A, it is assumed, as in elemen­tary consumer theory, that person A is capable of ranking them in a weak ordering. The outcome of this ranking process can then be illustrated on a two-dimensional diagram using indifference curves. If person A’s prefer­ences accord with certain axioms – the von Neumann-Morgenstern axioms6 – it can be shown that a utility function for person A can be con­structed UaGa) such that his preferences over risky prospects are consis­tent with the expected utility attached to the prospects. Indifference curves can then be thought of as lines of constant expected utility.7

The shape of A’s indifference curves will depend upon his attitude to risk. In Figure 5.1, each point G1A, π2A) represents a prospect. At point ‘a’ for example, person A has a claim or entitlement to π¡A if the harvest is good andπ2A if the harvest is bad. Since, as drawn, π¡A = π2A, this implies that, at such a point, person A would be certain of the outcome and would be bearing no risk. Given the probability of a good harvest P1, it is clear that any portfolio of claims represented by a point in the shaded set to the north-east of ‘a’ will be more preferred, and any portfolio repre­sented by a point in the shaded set to the south-west of ‘a’ will be less pre­ferred, than the portfolio at ‘a’. Thus, just as in conventional consumer theory, indifference curves are expected to slope downwards from left to right. The curvature properties of the indifference curves are more complex, however.

Consider the straight line p1π1A + p2A = E drawn through point ‘a’. By definition, all prospects along this line produce the same mathematical expectation (E) of the outcome as at point ‘a’. The slope of the line will be —(pj/p2) . Now consider a point such as b on this line. Will person A prefer ‘b’ to ‘a’ or vice versa? To answer this question, we note that a move from a to b implies a move from a riskless environment to a risky one. Person A would risk losing (k[a — ^j’A) in the event of the harvest being good, but would stand the chance of gaining (πA — π2A) in the event of the harvest being bad. However, since the expected outcome for person A is constant all along the straight line, we know that the gamble involved in moving from ‘a’ to ‘b’ is a ‘fair’ gamble. A ‘fair’ gamble is one with an expected value of zero. Person A would expect, in a mathematical sense, neither to gain nor to lose because

Whether person A would prefer point a or point b therefore reduces to the question of whether or not person A is prepared to take a ‘fair’ gamble. Any person who always rejects a ‘fair’ bet will prefer a to b. Such a person is ‘risk averse’. A risk-averse person will move to lower and lower levels of satisfaction (expected utility) as he or she moves away from point a in either direction along the line of constant expected outcome. Thus, the indifference curves of a risk-averse person will have the conventional convex shape familiar from elementary consumer theory. Figure 5.2 illus­trates the preference map of a typical risk-averse person. An important characteristic of this preference map is that the slope of the indifference curves along the 45° line from the origin (for example, at point a) will be equal to the slope of the constant expected outcome line —(p/p2) . In other words, along the certainty line where ^1A = ^2A, each person will be pre­pared to exchange claims contingent upon a bad harvest for claims contin­gent on a good harvest in the ratio (p1/p2). This applies only to ‘points such as a’ and is therefore a proposition about limits. Although the person is risk averse, he will approach indifference between point a and a fair gamble, as the ‘stakes’ become vanishingly small. Some use will be made of this prop­erty of indifference curves in future sections.

Returning to Figure 5.1, a person who is indifferent between point a and any ‘fair’ gamble represented by another point on the constant expected outcome line (such as point b or point c) is termed ‘risk neutral’. Variability of the outcome is of no consequence for such a person. The only matter of interest is the expected value of the outcome, and all combinations of claims which yield the same mathematical expectation of the outcome are equally preferred. Thus, the indifference curves of a risk-neutral person will be straight lines with slope — (pj/p2) corresponding to lines of constant expected outcome. Maximising expected utility for this person will be the same as maximising the expected outcome. For completeness, we should add that a person who enjoys taking fair bets will prefer point b to point a, and hence a ‘risk-preferring’ person will have concave indifference curves. The case is not illustrated and no future use will be made of it.

We are now in a position to discuss the risk-sharing problem. Figure 5.3 is a ‘box diagram’ with A’s origin at the bottom left-hand corner and P’s origin at the top right-hand corner. The horizontal dimension of the box represents the total harvest if yields are good (nq) and the vertical dimen­sion represents the total harvest if the yields are bad (^2) . Any point within the box represents a division of the total harvest between the two people in both good times and bad. Thus, point r, for example, illustrates a case in which person A has entitlements given by distance OA^JA if the harvest is good and OA^’2A if the harvest is bad; while person P has entitlements given by distance OP^P if the harvest is good and OPrn’2P if the harvest is bad. Person A’s claims plus person P’s claims sum to the total harvest.

The question now arises: does point r represent an ‘efficient’ allocation of claims between persons A and P? The answer will depend upon the pref­erences of the people concerned and therefore Figure 5.3 illustrates one particular case. As drawn, person A is risk averse and his convex indifference curves are drawn with respect to an origin at OA; while person P is assumed to be risk neutral and his straight line indifference curves are drawn with respect to an origin at OP. At point r, person A has utility index UA and person P has utility index Up. It is clear, however, that by exchang­ing claims with one another, both could be made better off; there are gains from trade to be had. Any allocation of claims represented by a point in the shaded set in Figure 5.3 will benefit at least one of the parties without harming the other; that is, they represent ‘Pareto improvements’ on point r. ‘Efficiency’ is characterised by the absence of gains from trade, as was seen in Chapter J of this book. Points of tangency between A’s indifference curves and P’s indifference curves will be efficient points. An allocation rep­resented by a point such as s, for example, is ‘efficient’. Any move away from s must harm one or both of the parties, and therefore agreement to a move will be impossible to achieve.

As drawn in Figure 5.3, the locus of points of tangency between the indifference curves of the two parties lies along the 45° line out of A’s origin. It will be recalled that P’s indifference curves have a slope of (pj/p2) along their entire length, whilst A’s curves have a slope of (p/p2) along A’s certainty line. Thus, tangency must occur along A’s 45° line. The line between s and t represents the set of efficient allocations which are Pareto improvements on point r. By moving from point r to point t, for example, person P will be supplying person A with fair insurance. Fair insurance will benefit risk-averse A, whose utility index increases to UA. The risk taken by person P will be increased, but, for him, point t represents a fair gamble rel­ative to point r and, being risk neutral, person P’s utility index will be the same at t as at r. A move from r to s, on the other hand, will confer all the gains from trade on person P. Person P shoulders the entire risk still, but now on somewhat ‘unfair’ terms (in an actuarial sense) relative to point r. P’s expected return and hence expected utility has increased, while A’s expected return has decreased. The fall in the expected return to A does not result in a fall in expected utility because the greater certainty of the return at s compared with r is sufficient to compensate.

Figure 5.3 illustrates our earlier contention that efficient sharing of risk will involve a risk-neutral party providing complete insurance to the risk­averse party. Thus, in section 2, we saw that, where the state of the world 0 was observable and the effort-incentive problem could therefore be over­come, a risk-neutral landlord (employer) would take the entire risk and a risk-averse labourer (worker) would receive the same amount irrespective of whether the harvest turned out to be good or bad. Conversely, a risk­neutral labourer would pay a fixed fee to a risk-averse landlord and keep the residual harvest. Where both persons are risk averse, efficient alloca­tions of claims will be between the two 45° lines in Figure 5.3 and both parties will bear some of the risk.

4. EFFORT INCENTIVES

The principal-agent problem proper can now be considered by assuming that the probability of a good harvest is not given and unalterable, but can be influenced by the activity of the agent, person A. To keep matters as simple as it is possible to make them, let us assume that by exerting effort e, the agent is capable of changing the probability of a good harvest from pj topje wherepje>pj. The agent has a simple choice between two levels of effort, zero or e. His effort, however, is assumed to be totally unobservable by the principal, P. The final outcome or harvest can be observed by both parties, but this is all. Let the agent be risk averse and the principal be risk neutral.

Consider now the effect of effort on the agent’s indifference map. Because effort increases the probability of a good harvest to pje, the slope of all the agent’s indifference curves along A’s certainty line will steepen to —pj e/(J – pje). At any particular point on this certainty line, however, the utility index will be lower, because effort we assume is unpleasant and reduces A’s level of utility. In Figure 5.4 the indifference curve of the agent through 0 is drawn assuming no effort is exerted. The slope at 0 is — pj/(j — pj) and the utility index is UA. The curve through a is drawn on the assumption that effort eis exerted. Its slope at a is (— pje/(l — pje)) and the utility index is also UA. To show that it applies to situations where effort is being exerted, the curve is labelled UAe. Distance 0a is a measure of the ‘cost’ to A of effort e.

These two curves UAe and UA intersect at point r. A portfolio of claims represented by point r would just leave the agent indifferent between exert­ing effort e and not exerting any effort. At any point to the right of r and between UA and UAe, the agent would strictly prefer effort to no effort. At point s, for example, the agent will achieve a higher utility index by oper­ating on his ‘with effort’ set of indifference curves than his ‘without effort’ set. By providing the agent with a portfolio of claims at s, we can induce the effort e. No monitoring is possible and none is required. The agent’s own self-interest will be sufficient to produce the effort. We have loaded his claims sufficiently heavily in favour of the good harvest that he has an interest in increasing the probability of this favoured event occurring; an interest powerful enough at s to overcome the disutility associated with effort.

By an identical process of reasoning, we can deduce that at point r’ in Figure 5.4, the agent is also indifferent between effort and no effort. At this point, the agent’s utility index is UA> UA, whether or not effort is forth­coming. Joining up the points of intersection between ‘with effort’ and ‘without effort’ indifference curves applying to the same utility index, a locus such as rr’ is traced out. Points to the right of rr’ will induce effort e. Points to the left will not.

If it is possible to induce effort e from the agent, we still do not know whether both parties would agree to such a contract, or whether the prin­cipal would be as well off leaving the agent to relax in security at a point along his certainty line. It is this question which Figure 5.5 attempts to answer. At a point such as 0, risk would be efficiently shared between persons A and P as was shown in section 3. For effort to be forthcoming from the agent, however, a contract to the right of rr’ must be agreed. Such a contract, by increasing the probability of a good harvest topxe, will affect P’s indifference curves and not merely person A’s. P’s indifference curves will now have a slope of -pje/(1 -pxe) along their entire length. The new P indifference curve yielding the same utility index as at 0, but applying to a situation in which the agent is exerting effort, is labelled UpA. Note that it cuts P’s original indifference curve UP at π along P’s certainty line. Clearly a given point on P’s certainty line will yield the same utility index irrespec­tive of the probability of a good harvest since, along that line, P is com­pletely insulated from the effects of variations in the harvest.

Inspection of Figure 5.5 reveals that there exists a set of contracts (the shaded set wxy) which consists of Pareto improvements on the contract at 0. Points in the set wxy are between the curves UAe and UpA, thus ensuring that the utility index of both principal and agent will be at least as great as at 0, and to the right of rr’, thus ensuring that it is the ‘with effort’ indifference curves that will be relevant and that effort e will be forthcom­ing from the agent. Of this set of Pareto improvements on 0, Pareto- efficient contracts will lie along the boundary between x and w. A move from point w, for example, to any other point in the shaded set will harm person A. A move from point x would harm person P. Conversely, from any point within the shaded set, it will be possible to find a point on the bound­ary which is preferred by both A and P.

It is important to notice that along the boundary xw, risk is not shared efficiently between principal and agent. The indifference curves of princi­pal and agent intersect (for example, at point w) indicating that ideally there are risk-sharing benefits to be achieved by a move to A’s certainty line between s and t. These benefits are unachievable, however, because of the observability problem. Figure 5.5 therefore illustrates clearly the distinction drawn in principal-agent theory between ‘first-best’ solutions (achievable only in an ideal world of perfect observability) which lie along A’s certainty line and involve the efficient sharing of risk, and ‘Pareto-efficient contracts’ which lie along xw and are the best that can be achieved in the context of unobservable effort and state. Along xw, risk-sharing benefits are sacrificed in the interests of providing incentives. The sacrifice will be worthwhile, providing that the agent’s effort is not too costly to him for any given effect on the probability of a good harvest, or, conversely, that for any given level of the disutility of effort the effect on the probability of a good harvest is sufficiently pronounced. It is possible to envisage a case in which the dis­tance 0a in Figure 5.5 is so large and (pxe -px) is so small that the set of Pareto improvements on 0 is empty.

Although a risk-averse agent may be made to bear some risk in the efficient contract, it is worth noting that we will never observe him bearing the entire risk. For a risk-averse agent to bear the entire risk we would have to imagine an efficient contract existing somewhere along P’s certainty line. In terms of Figure 5.5, the locus xw would have to cross P’s certainty line at some point. Risk aversion and the resulting convexity of A’s indifference curves ensure, however, that the point x must always lie to the right of UP. But points to the right of UP along P’s certainty line will leave person P with a lower utility index than at 0. Thus, there can never be an efficient contract on P’s certainty line which is Pareto preferred to 0 when A is risk averse.

Where the agent is risk neutral, however, and the principal is risk averse it is no longer necessary to sacrifice risk-sharing benefits to achieve incen­tives. Incentives and risk sharing are compatible, as can be seen from Figure 5.6, and the agent will optimally bear the entire risk. The structure of the figure is the same as that of Figure 5.5. Once more contracts to the right of rr’ provide an incentive to A to exert effort level e. The set of con­tracts Pareto preferred to 0 is given by the shaded area. In this case, however, the boundary xw no longer represents the set of Pareto-efficient contracts. At w, for example, the agent could be made better off without harming the principal by a move to point ^, thereby achieving a more efficient distribution of risk. Points along P’s certainty line share risk efficiently and induce effort e from the agent. ‘Pareto efficient contracts’ are ‘first best’ even under conditions of unobservability. Information on the agent’s effort or on the state of the world has no value to the principal. At the agent pays a fixed fee to the principal and is entitled to keep what­ever remains of the harvest.

5. INFORMATION

A risk-averse agent and a risk-neutral principal (Figure 5.5) will have to sacrifice risk-sharing benefits if effort is to be induced under conditions of ‘unobservability’. The ability to observe the agent’s effort is therefore valu­able because it enables risk-sharing benefits to be captured. Information about the agent’s effort may be subject to error, however, and it is not imme­diately obvious whether principal and agent would both agree to use this kind of information in their contractual arrangements. As reported in section 2, abstracting from the costs of writing and enforcing an increas­ingly complex contract, an informative signal, no matter how noisy, can be used to increase the utility of both parties. In this section we illustrate this proposition using the simple example of principal and agent discussed in section 4.

Suppose to begin with that no information about the agent’s effort is used in the contract. Efficient contracts will then lie along the locus rr’ in Figure 5.5. Consider the efficient contract at x. As argued at length earlier, the agent at x will just be indifferent between effort and no effort. By exerting effort the agent faces one gamble and by not exerting effort he faces another (different) gamble. He is just indifferent between these two gambles. Specifically, if the agent remains idle he faces the following gamble:

where in this case π*e1A represents the outcome π*1A bushels having exerted effort level e. From our diagram we know that for person A, GIGe, where I represents indifference.

Now let the principal (landlord) monitor the agent (labourer) through a series of spot checks mentioned in section 2. He may visit the field at random a given number of times. If the labourer is never there, the land­lord will conclude (perhaps wrongly) that his effort level has been zero. Providing the labourer is there at least once, the landlord will conclude (again perhaps wrongly) that effort level e has been forthcoming. Satisfying this crude spot-check test will yield a reward of + Se and not satisfying it will incur a penalty of – 8o. Might such an arrangement be agreeable to both principal and agent?

Ignore for the present the problem of how the labourer is to know whether the landlord is telling the truth when the latter claims to have seen neither hide nor hair of the former. This is an important issue which will be taken up later in Chapter 6 on hierarchies. Assuming that the landlord monitors in the way described, the labourer will be able to calculate the probability that he is observed hard at work when in fact he has been idle, as well as the probability that he is observed to be idle when in fact he has been hard at work and so forth. Let the first subscript represent the land­lord’s observation of effort and the second subscript the tenant’s actual effort. Thus let

qoe = probability that landlord observes zero effort when actual effort is e;

qee = probability that landlord observes effort e when actual effort is e;

qeo = probability that landlord observes effort e when actual effort is zero;

qoo = probability that landlord observes zero effort when actual effort is zero.

With these probabilities, we can construct a number of new gambles which we might call ‘monitoring gambles’. Consider, for example, the following prospects:

These are the gambles the labourer would face if his effort level were zero. They depend upon the harvest. If the harvest turned out to be good and the labourer had agreed to be monitored, he would then face gamble miA. Similarly, if the harvest were bad, the labourer would then face gamble m2A. By agreeing to be monitored, therefore, the idle labourer would change the original gamble G into the complex gamble Gm where

Suppose now that qoo > qeo; that is, the landlord is more likely to observe ‘correctly’ than ‘incorrectly’. If the labourer is idle, it is more likely that he will be observed as such than that he will be observed exerting effort e. This is our minimal requirement for the landlord’s monitoring to be ‘informa­tive’. If qoo = qeo, the spot checks would produce a signal that was all ‘noise’ and no information. If qoo< qeo, the signal would be positively misleading. Further, assume that 8e < 8o.

Clearly, if qoo > qeo, and 8o >Se, gambles miA and m2A are statistically ‘unfair’. The labourer would prefer ^*A to the gamble miA, and rn*LA to the gamble m2A, assuming that he is risk averse. Thus, we deduce that the idle labourer will not want to be monitored and that he will prefer G to Gm.

For the industrious labourer, the situation is rather different, however. He will face the ‘monitoring gambles’:

If he consents to be monitored, the original gamble Ge will change to the more complex gamble

Again, suppose that the monitoring of the landlord is ‘informative’ so that qee >qoe and that the industrious labourer is more likely to be seen as indus­trious than idle. Further, assume that the reward 8e and the penalty 8o are set such that 8e<8o as above, but also qee8e = qoe8o. Gambles me1A and me2A will then both be ‘fair’ gambles.

As was seen in section 3, a risk-averse labourer will reject a fair bet. However, we also saw in section 3 that as the ‘stakes’ were reduced a risk­averse person would approach indifference between taking and not taking a fair bet. People are risk neutral ‘in the limit’. Thus, we might imagine 8e and 8o being reduced in size whilst always maintaining the ratio 8e/8o = qoe/qee. In the limit, the industrious labourer will be indifferent between Gme and Ge. Because the ‘monitoring gambles’ m1A and m2A are statistically unfavourable, the idle labourer, on the other hand, will always strictly prefer G to Gm and can never be brought to indifference no matter how tiny the stakes.

At point x, therefore, we have the following results as 8e,8o tend to zero with 8e /So = qoe /qee

Thus, GmePGm. Without the monitor, the labourer at x is just indifferent between effort and no effort. With the monitor and an ‘informative signal’, we can construct a situation in which the labourer strictly prefers effort to no effort at x and is no worse off in his own estimation than he was in the absence of the monitor (GmeIGeIG).

Consider now how the principal is affected by these arrangements. We continue to assume that the signal is costlessly observed. Providing that the monitoring gambles me1A and me2A are ‘fair’, the risk-neutral principal will be prepared to offer them to the agent whatever the absolute sizes of 8e and 8o. Even if the principal were risk averse, we could apply the same argument used above for the agent to show that he would approach indifference as the stakes declined. Thus, in the limit, neither principal nor agent will be any worse off from a risk-bearing point of view as a result of the ‘monitoring gambles’. Work incentives, however, have changed, as we have seen, and the benefits from this enhanced work incentive can be used to increase the utility index of either the principal or agent.

The argument is illustrated diagrammatically in Figure 5.7. The locus rr’ represents, as before, points at which the agent is indifferent between effort and no effort, when no information about effort is used in the con­tract. Monitoring, as described in the earlier paragraphs of this section, using noisy but informative signals, results in effort being strictly preferred by the agent at x. It will therefore be possible to find another point such as y at which the agent is once more indifferent between effort and no effort even in the context of monitoring.8 The locus of all such points might be represented by the curve mm’. At y, the utility of the industrious labourer will be the same as at x, but the utility of the landlord will be greater than at x. Monitoring, by providing a source of additional effort incentives, enables risk-sharing benefits to be achieved without reducing the amount of effort forthcoming. In effect, monitoring is being used as a substitute for a more inefficient distribution of risk as a means of induc­ing effort.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Examples of incentive contracts in the firm

The principles which have been outlined in the first five sections of this chapter have applications which are more wide ranging than the share­cropping case which we have thus far been using for illustrative purposes. Harris and Raviv (1978) provide a number of interesting examples.

1. Health and Motor Insurance

Moral hazard in insurance markets is a classic problem, as was seen in Chapter 2. In terms of the content of Chapter 5, however, what kind of insurance contracts are likely to be observed and in what situations? Let θ, the state of the world, stand for ‘degree of illness’ instead of weather con­ditions. Let e stand for ‘health effort’ (for example, not smoking, taking safety precautions in dangerous tasks and so forth). Finally, let ^ represent ‘amount of healthcare used’; that is, the outcome.

A contract which depends on the outcome alone (that is, simply on the amount of healthcare someone consumes) will, according to the earlier dis­cussion, be inefficient relative to a contract which depends on both the outcome and the state if the latter is observable. Thus, if ‘degree of illness’ can be observed, we would expect contracts to specify payments condi­tional upon the degree of illness, just as payments to the labourer in the earlier example depended if possible on the weather and not merely the harvest. Assuming ‘degree of illness’ is unobservable, information about ‘health effort’ should be valuable. If health effort is observable, again we would expect this to be reflected in contracts; for example, lower premiums for non-smokers, higher premiums for those who refuse to wear seat belts in cars, and so on.

In the case of motor insurance, θ could represent ‘difficulty of driving conditions’, e ‘safety effort’, and π ‘damage to the insured and to third parties’. Clearly, a contract which depends only on damage will be inferior to one based on both π and θ if the latter is observable. Thus, we might expect insurance contracts to vary by geographical area if ‘driving condi­tions’ differ. If θ is not observable, information on safety effort will be valu­able. This too may be very difficult to observe, but if driving care is perfectly correlated with age, for example, we would expect to see insurance con­tracts varying with the age of the insured.

2. Law Enforcement

Here the principal is seen as the political representative of the general public, while the agent is the policeman whose task it is to detect and punish crime. Suppose the outcome ^ in this case is revenue generated by fines. Effort e will be ‘policing effort’ such as patrols, observations and whatever other activities compatible with the law raise the total of fines collected. 0 represents the ‘state of crime’. It reflects the type and seriousness of crimes committed and, to fit into the principal-agent framework of this chapter, it is assumed that this ‘state of crime’ is independent of policing effort e.

Applying the sharecropping results to this new situation, an efficient contract would be expected to involve both the outcome, ‘fines generated’, and the ‘state of crime’. The government would receive a payment from the police which depended only on the state of crime and the police would keep the residual fines. Risk could then be distributed in accordance with the principles of section 3. If the government were risk neutral and the police were risk averse, the residual fines in each ‘state of crime’, given that the standard policing effort had been applied, would ideally be constant. On the other hand, if the police were risk neutral, no information about effort or state would be necessary for the efficient contract. The police would pay a flat fee to the government independent of the state of crime and would keep whatever revenue in excess of this fee they succeeded in generating. Effectively, the police would be paying a certain sum for the ‘policing fran­chise’ of an area and would be rewarded by their success in levying fines. Unsettling though such arrangements undoubtedly appear, there seems little doubt that they are well designed to produce a dedicated police force (dedicated, that is, to collecting fines). A more detailed discussion of ‘fran­chising’ as an incentive device and form of business organisation occurs in Chapter 7, while Chapter 15 contains a discussion of the role of franchis­ing in government regulation.

Assuming that the state of crime is unobservable, information on effort is valuable if the police are risk averse. Even if the information is ‘ noisy’, it may be used to improve incentives, as we have seen in section 5. A police­man’s contract might, for example, be designed so that effectively he ‘posted a bond’ which would be returned to him providing that malfeasance remained undetected over a specified period. Corruption or negligence would result, if observed, in the loss of the value of the bond. In other words, loss of the bond would be equivalent to the term —8θ which appears in the monitoring gambles of section 5.

3. Employment Contracts

Consider first a case in which the employee’s output is observable, whilst effort and state are unobservable. This case is identical with the sharecrop­ping example discussed in section 4. The employee must take some risk if effort is to be induced, and this will involve inefficient risk sharing unless the employee is risk neutral. In the latter case, we once more observe the ‘franchising’ solution, with the ‘employee’ paying a fee to the ‘employer’ for permission to use specified resources for a given period of time.

Where employees are risk averse, a contract involving both outcome and state will be preferred to a contract involving just the outcome. The state variable 0 might represent ‘market conditions’, in which case, if these were ‘observable’, contracts might be expected to link remuneration to some economic indicators of these conditions. Employees could be envisaged to pay a fee to the employer conditional only upon the state. This fee would presumably be lower in times of depression and higher in times of pros­perity if workers were risk averse. The employer effectively offers insurance to the workers and takes the brunt of economic fluctuations. In the extreme case of a perfectly observable state variable θ, and a known function π = π (θ, e), the worker’s remuneration would be constant in both prosperity and depression. Although practical examples of contracts explicitly written in this way are not easy to find, it has been argued that many contracts of employment are implicit.9 The employer accepts an implicit unwritten obligation to insure the workforce against fluctuations in the way described. It is suggested that this implicit obligation helps to explain the ‘stickiness’ or ‘inflexibility’ of wage rates over the business cycle. Along with supple­mentary assumptions about the type of social security system operating, or the nature of information about θ (it may be asymmetrically distributed and available to employers but not employees), the implicit contracts liter­ature has attempted to explain the existence of levels of employment greater than or less than would be observed under symmetric information. We shall discuss some of the ideas of the implicit contracts literature in a little more detail in Chapter 6.

Another application of the principal-agent results to employment con­tracts concerns the use of educational qualifications. Suppose that output depended not only on effort e but also on ‘native ability’ (Harris and Raviv, 1978). The state variable 0 would now refer to the ‘native ability’ of the worker. Clearly this is not easy to observe, but if employers believed that ‘native ability’ was reflected in educational attainment we would expect remuneration to depend on qualifications and not just on the outcome. According to this view of things, education does not necessarily equip people with specific skills, but provides them through a series of tests, no matter how pointless in other respects, with evidence about their ‘native ability’. Education is a ‘screening procedure’10 which tests for a particular type of attribute. This evidence can then be used by employers when con­tracting with employees.

Where the state variable is unobservable and workers are risk averse, information about effort is valuable. Perfect observability of the employees’ effort by the employer would simply result, of course, in a contract depen­dent on effort alone. The risk-averse employee would again be assured of the outcome, and the risk-neutral employer would take the risk. This, it will be recalled, is the principal and agent result considered in Chapter 4, where the ‘single proprietorship’ was discussed. There, however, it was assumed that monitoring was costly and there was no presumption that effort was perfectly observable. It is worthwhile remembering, therefore, why it was that, even with costly monitoring, the contract did not involve the outcome and depended on effort alone. Conditions of team production implied that output could not be ascribed to particular members of the team, and the incentive effects of linking individual rewards to the collective outcome are minimal for large teams. In terms of our discussion of this chapter, there­fore, contracts dependent upon the outcome will not induce effort under conditions of team production, because the probability of a preferred outcome is perceived to be only very weakly related to individual effort. A large individual effort (distance 0a in Figure 5.4) will not greatly affect the slope of the indifference curves so that the set of contracts xyw in Figure 5.5 disappears and there are no Pareto improvements on θ. From an indi­vidual point of view, the ‘efficiency of effort’ is small, even though jointly it may be very high.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Monitoring the effort of team members in the firrm

In the light of the above, the individual employer under conditions of team production must therefore rely on monitoring to induce effort. We have already surmised in Chapter 4 that the proprietorship will be viable only if the returns to monitoring are ‘sufficiently great’. Again, to reinterpret this observation in the light of the analysis of Chapter 5 may be useful. Assume, as above, that there are only two effort levels, zero and e, which employees can choose. Assume further that if everyone chooses effort e the probabil­ity of outcome π will increase top1efrompy, otherwise the probability of will remain at pj.11 Figure 5.5 can be reinterpreted to apply to the employer P and the ‘typical employee’ A, with indifference curves UAe now reflecting the utility index of the employee when the whole team is exerting effort e. If the employer could be absolutely certain of every employee’s effort simply by monitoring at a given level of intensity, the team would obviously be viable, providing the potential benefits, distance at, multiplied by the number of employees, exceeded the costs to the employer of the crit­ical level of monitoring required. Each employee would have a contract at a and would be monitored sufficiently intensely to determine with certainty that effort e was forthcoming. Providing the average monitoring cost per employee fell short of at, both employer and employee are capable of becoming better off than remaining at 0.

A natural extension of this analysis is to consider what would happen if the employer could observe the employee’s effort only with some accom­panying error. In section 5 it was shown how even a very ‘noisy’ signal could, in principle, improve the positions of both principal and agent.

However, in that section we were considering a case in which there existed a contract x involving the outcome alone which was Pareto preferred to the original position at θ in Figure 5.5. The addition of any informative, though noisy, signal could be used to improve further on such a contract. In the case here, however, we have assumed that no contract based on the outcome alone is viable, and our point of departure is from θ. Clearly, from this inauspicious starting point the principal’s information about the agent’s effort will have to be reasonably good if it is going to make much difference. Some signals may simply not be ‘informative enough’ to be capable of pro­viding opportunities for mutual benefit. On the other hand, intuition sug­gests that perfect information may not be necessary, and that a ‘sufficiently informative’ signal, which was nevertheless subject to error, might be useful.

That such a possibility exists can be seen by considering once more the monitoring gambles of section 5. There we saw that the labourer could, providing the monitor was more likely to observe ‘correctly’ than ‘incor­rectly’, be presented with a reward structure which implied that an idle labourer would be taking an ‘unfavourable’ gamble while an industrious labourer would be taking a ‘fair’ one. Now, unlike the labourer at point x in section 5, this will not be sufficient, in itself, to ensure that the labourer at a point such as a will prefer effort to no effort. The idle labourer at a will not like having to take an unfavourable gamble if he is monitored, but he does not like exerting effort e either, and the ‘effort price’ of avoiding the unfavourable gamble may be too high for him. (By contrast it will be recalled that at point x in the example of section 5, the labourer was indifferent between effort and no effort, so that there was no ‘effort price’ of avoiding the unfavourable gamble.)

Although the prospect of any unfavourable gamble will not automati­cally induce effort at point a, it is clearly possible that the prospect of an extremely unfavourable gamble might do the trick. At point a, the employee or agent would face the ‘monitoring gamble’:

assuming that effort zero were chosen. If the principal’s information is very good, so that both qeo (that is, the probability of being observed working when the agent is in fact idle) and qoe (the probability of being observed shirking when the agent is really working) are very small, the gamble mAa will be extremely unfavourable. Thus, the more reliable the information of the principal, the more adverse is the monitoring gamble taken by the shirker. It is at least possible to envisage a point at which work becomes more attractive than the monitoring gamble associated with idleness.

We cannot appeal to the limit theorem used in section 5 to prove this proposition, since it is clear that the ‘stakes’ cannot be infinitesimally small if the jump in effort from zero to e is to be forthcoming. The employee, or agent, will have to face values of Se and S0 sufficiently large to induce effort e whilst the value of π must be chosen so as to leave him as well off as he was at a. It is not, therefore, sufficient for the industrious labourer to be offered a ‘fair’ monitoring gamble at a, since risk-averse people will be made worse off by non-infinitesimal fair bets. The monitoring gamble will have to be favourable at a, or, if it is fair, the contract point must be to the right of a along A’s certainty line. In the latter case, providing a point exists to the left of t (say point p) where the agent is as well off exerting effort and taking a fair monitoring gamble as he was at θ, and providing the moni­toring gambles are so constructed that effort is preferred to idleness at p, the use of the principal’s less than completely reliable information may permit Pareto improvements on θ. Distance ap in Figure 5.5 represents the cost to the risk-averse agent of taking the fair monitoring bet. Clearly, other things constant, the bigger are S0 and Se the bigger will be the distance ap, but the more reliable the principal’s information the more adverse is the monitoring gamble of the shirker and the smaller can be the value of S0 and Se compatible with making effort the preferred option on the part of the agent. Thus, more reliable information will be associated with a smaller dis­tance ap.

Figure 5.8 may help to illustrate the relationship between effort costs θa, risk-bearing costs ap, and the monitoring gambles. At point p, the agent receives ^ with certainty. This gives no incentive to exert effort. Now offer the agent a monitoring gamble, π + Se, conditional upon being observed working, and π – S0, conditional upon being observed idle. Further, ensure that this monitoring gamble is ‘fair’ for the industrious worker so that QeeK = %eh with Vee > ?fl*. Call this monitoring gamble point mp. Clearly, the agent will be worse off at mp than at p. The cost of bearing the uncertainty of the fair monitoring gamble is ap. For the idle labourer, however, fair gambles occur along the line through p slope -q00/q0e and hence the gamble at mp is clearly unfavourable. Draw the idle labourer’s indifference curve through mp and let it cut the certainty line at θ. If dis­tance θa is the cost to the agent of exerting effort level e, the agent will be indifferent at mp between effort and no effort; mp will represent the least costly monitoring gamble which is ‘fair’ to the industrious labourer and is just able to induce effort e.

The effect of more reliable information can now be deduced from the figure. An increase in qoo/qeo (that is, better information about the shirker) will steepen A’s indifference curve through θ. It follows that, at mp, A’s utility index will be lower than before if he shirks, and he will strictly prefer effort to no effort. Thus, a less costly monitoring gamble between mp and p can be found. Even perfect information about the shirker will not remove risk-bearing costs entirely, however, if there is a chance of perceiving an industrious labourer as idle. Zero risk-bearing costs would require perfect information about the industrious worker; that is, qoe/qee = 0.

If we now introduce the complication that reliability of information may depend upon the monitoring costs incurred by the principal, we see that the distance pt in Figure 5.5 will exaggerate the gains to be had from monitor­ing by the amount of these costs. This suggests that there may be some efficient amount of monitoring effort on the part of the principal. At very low levels of monitoring effort, the information may be so unreliable that p lies to the right of point t and no Pareto improvement on θ is possible. At very high levels of monitoring effort, point p may lie close to point a. The risk-bearing costs associated with the monitoring gamble are low because information is very reliable, but in this case the monitoring costs may be so big that they more than absorb the potential benefits pt, and there is still no Pareto improvement on θ. If monitoring is to be viable, there must be a point at which the distance pt minus total monitoring costs is positive, and the efficient amount of monitoring effort will be that at which distance pt minus costs of monitoring is maximised. This gives us a new perspective on Figure 4.2 in Chapter 4. There we argued that monitoring effort would be applied to the point at which marginal benefits and costs were equal. Here we are looking behind the MB1 schedule. The benefit of additional moni­toring in this framework is more reliable information. A given amount of effort e can thus be induced from each employee using monitoring gambles involving lower risk-bearing costs.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.

Incentive contracts and the firm

In this chapter our primary task has been to consider in some detail the contractual problems which face two parties when outcomes are uncertain, and when information cannot be observed or is costly to observe and subject to error. It is appropriate at this point to discuss explicitly the significance for the theory of the firm of the issues covered in the first six sections. Sharecropping, franchising, insurance contracts, police incentives, and even some of the employment contracts discussed earlier, may at first appear rather specialised and peripheral concerns rather than of central importance. Yet, as has been emphasised from the very outset, the firm is a particular type of contractual environment, and its characteristics would be expected therefore to be moulded by the sorts of considerations which formed the basis of our discussion above. In Chapter 1, the firm was ratio­nalised as a device for coping with uncertainty and the passage of time; in Chapter 2 it was seen as a response to ‘opportunistic behaviour’ resulting from ‘information asymmetry’, in Chapter 4 the firm evolved to permit policing and monitoring of cooperating inputs in conditions of ‘team pro­duction’. All these ideas are closely interrelated. They all reduce ultimately to the firm as an institutional consequence of imperfect information, and they all have in common the idea of the firm as a nexus of contracts with a central agent.

Within the nexus of contracts called the firm, however, there is scope for considerable variety. The nature of the contracts is not absolutely standard and will vary as conditions vary. As we saw in Chapter 2, Coase (1937) stressed the direction of resources and the employment or ‘authority’ rela­tion as characterising the typical contract. No doubt there are particular cases in which this is descriptively not unrealistic, but labour is not the only type of input used by the firm and, as pointed out at the end of Chapter 2, positive monitoring costs associated with less than perfect ‘observability’ (Williamson would say ‘information asymmetry’) will usually mean that employer-employee relationships have attributes similar to those of prin­cipal and agent. The sections above have indicated, however, that there are many possible ‘solutions’ to the principal-agent problem, depending on assumptions about what is and what is not observable, the risk preferences of the parties, the costs of monitoring and so forth.

In some firms, people will be closely monitored, whereas in others they may have wider discretion. Some will be paid according to their own par­ticular output (for example, by piece rates), others in ‘team’ environments (in the sense of Alchian and Demsetz) may be paid according to their effort. Payment according to effort will involve ‘monitoring gambles’ as described above, and the nature of these may also vary between firms. If monitoring of effort is very efficient and reliable, the risk involved in being monitored may be small, perhaps involving small bonuses or other prizes. Where mon­itoring is less reliable, the gamble may be substantial, as when promotion to a higher grade involves a considerable pay rise. Thus, the structure of a hierarchy can be seen as being closely related to the provision of incentives through what we have called ‘monitoring gambles’ and this is a topic which will be taken further in Chapter 6.

It is important to remember also that within a single firm different people will have different types of contract. A senior manager whose performance is difficult to monitor by shareholders may have a contract which links remu­neration to the outcome through stock options or bonuses linked to profits. Similarly, a travelling salesman for the company may depend heavily on his particular sales record, the difficulty of monitoring implying that the sales­man has to tolerate considerable risks. The professional designers of the product, on the other hand, are also difficult to monitor, but because indi­vidual ‘output’ may be impossible to measure, ‘effort’ may be monitored and incentives given by means of promotion through a hierarchy. At least a design team is likely to be concentrated in a particular geographical loca­tion, and members may thus be assessed by more senior leaders of the team. The people who actually fabricate the product could have quite different contracts again, depending on the technological context and its implications for monitoring. To take just one example, a simple production-line process involving little chance for ‘shirking’, with the line speed predetermined by the manager, could result in a standard payment per ‘shift’. Providing the number of ‘shifts’ worked per week does not vary, the employee would be relieved of all risk. For a risk-averse employee, this would be preferred to a contract dependent upon output, since output would vary with technical problems such as ‘breakdowns’ or poor components. Where, however, as is almost invariably the case in practice, the chance of a breakdown or the quality of the final product depends to some degree on the alertness, con­centration, or dexterity of the employee, incentives via the possibility of promotion may still be used even on production lines.

Where monitoring is very costly, and effort and state of the world are effectively ‘unobservable’, contracts will depend upon the outcome alone. A particular example of this arrangement is the franchise contract, and we have discussed some of its properties in an earlier section. In the present context, however, the franchise contract warrants closer inspection because it represents a relationship which leaves the franchisee with a large area of discretion as to how affairs should be conducted. It is thus far distant from the ‘authority relation’ discussed by Coase, or the exercise of ‘conscious power’ mentioned by Robertson. From the point of view of economic prin­ciple, however, a franchise chain would seem to have the major character­istic required to make it a single ‘firm’. A franchise chain is a ‘nexus of contracts’ in which each franchisee has a contract with a single contractual agent or franchisor. Legally, these contractual arrangements exist between separate ‘firms’, yet economically the transactions might be regarded as taking place within a single firm. This is the essence of a contribution by Rubin (1978, p. 225) which emphasises the somewhat arbitrary distinction between interfirm and intrafirm transactions and argues that ‘the economic concept of a “firm” does not have clear boundaries’.12

The theoretical developments reported in this chapter are in the neo­classical ‘contracts’ tradition. There are several features of firms which need to be borne in mind when interpreting the results.

  1. Firms will usually be contracting with several agents (whether workers or suppliers) and if all agents face similar environmental conditions (even if these are not observable by the principal) relative performance may be used to assess an agent’s effort. Providing the agents do not con­spire against the principal, information on relative performance can be used in contracting. The firm, in other words, runs a tournament. Further analysis of the tournament as an incentive device is provided in Chapter 6.
  2. In a market setting, repeat dealing may be important. Similarly, within a firm contracts continue over time and are not isolated events. This means that information on an agent’s past performance can be used in future contracts. Shirking can result in a process of settling up ex post, and agents will have to consider the implications of a poor outcome for the terms of future agreements.
  3. If agents differ from one another in their level of skill, the problem of choosing the agent and adapting the contract to his or her particular attributes becomes important. In the case of risk-neutral agents, the choice might be made by auctioning a contract to the highest bidder. Some of the contractual difficulties with this approach are considered in Chapters 11 and 15 in the context of the contracting-out of govern­ment services. If competitive bidding is not suitable, screening devices may, in some circumstances, be used to separate agents of differing quality. Chapter 6 explains the principles of screening mechanisms in more detail.
  4. The agency contracts investigated in the earlier sections of this chapter were complete. Because there were only two possible outcomes, two possible effort levels, and a single period of time, the contract could be specified relatively simply. Contracts within the firm, as emphasised in earlier chapters, are incompletely specified as a result of bounded ratio­nality. In this respect, principal-agent theory does not take into account one of the major elements of the nexus of contracts concep­tion of the firm.

Source: Ricketts Martin (2002), The Economics of Business Enterprise: An Introduction to Economic Organisation and the Theory of the Firm, Edward Elgar Pub; 3rd edition.