This eBook is for the use of anyone anywhere at no cost and with almost no restrictions whatsoever. You may copy it, give it away or re-use it under the terms of the Project Gutenberg License included with this eBook or online at www.gutenberg.net ** This is a COPYRIGHTED Project Gutenberg eBook, Details Below ** ** Please follow the copyright guidelines in this file. **
Title: Definition & Reality in the General Theory of Political Economy
Author: Thomas Colignatus
Release Date: May 30, 2005 [eBook #15939]
[Most recently updated: June 12, 2005]
Language: English
Character set encoding: windows-1252
***START OF THE PROJECT GUTENBERG EBOOK DEFINITION & REALITY IN THE GENERAL THEORY OF POLITICAL ECONOMY***
Copyright (C) 2005 by Thomas H.A.M. Cool
Thomas Colignatus
Dutch University Press
&
Samuel van Houten Genootschap

This text is made available by the copyright owner as an etext for the Project Gutenberg (see http://www.gutenberg.org). The Project Gutenberg header specifies under what general conditions the book may be distributed while associated with the Project Gutenberg trademark. Additional conditions are: (a) No commercial use or resale of the etext is allowed, (b) Hardcopy prints are to be acquired at the publisher Dutch University Press (see http://www.rozenbergps.com and ISBN 90-3619-172-6) or their designated partners. Please note that the etext uses HTML so that the etext layout will differ significantly from the official layout.

2nd edition,
January 2005
(The first edition was in March & June 2000)
Copyright © Thomas H.A.M.
Cool
http://www.dataweb.nl/~cool, cool@dataweb.nl
Colignatus is the preferred name of Thomas Cool in science.
Lawful exceptions excluded, nothing of this publication may be copied or published by means of print, photocopy, microfilm, electronics or otherwise, unless a written permit has been given by the lawful owners of the copyright, and this also holds for whole or partial reworkings.
Behoudens uitzondering door de wet gesteld mag zonder schriftelijke toestemming van de rechthebbende(n) op het auteursrecht niets uit deze uitgave worden verveelvoudigd en/of openbaar gemaakt door middel van druk, fotokopie, microfilm, electronica of anderszins, hetgeen ook van toepassing is op gehele of gedeeltelijke bewerking.
|
Published by
Dutch University Press
|
Supported by
Samuel van Houten Genootschap Wetenschappelijk bureau van
het Sociaal Liberaal Forum |
ISBN 90-3619-172-6
JEL A00
Nederlandse uniforme genre indeling (NUGI) 681, 654 & 691
CIP-GEGEVENS KONINKLIJKE BIBLIOTHEEK ’S-GRAVENHAGE
Prologue
The basic idea of this book is that Keynes’s General Theory is generalised even further by including endogenous government in the model, so that we arrive at a truly general Political Economy. The world had the Great Depression 1930-1940 and has the Great Stagflation 1970-today and by including ‘stagnation in economic policy making’ in our analysis we find a better explanation. The general theory also advises a democracy to create an Economic Supreme Court as a separate constitutional power, next to the Legislative, Executive and Judicial branches.
This book is primarily directed at my fellow economists and it primarily gives theory and stylized facts. The colleagues will specifically have to understand the ‘Definition & Reality methodology’ before they will appreciate that my analysis is scientifically warranted. Much work remains to be done in practical research. And much work remains to be done by the other professions.
Since the current imbalance of powers has many victims, it may be hoped, none the less, that the parliaments of our democratic nations investigate the issue too, so that there is more hope for improvement in their living conditions. Parliaments should do as Alfred Marshall (1890, 1947:3) wrote:
“Now at least we are setting ourselves seriously to inquire whether it is necessary that there should be any so-called ‘lower-classes’ at all: that is, whether there need be large numbers of people doomed from their birth to hard work in order to provide for others the requisites of a refined and cultured life; while they themselves are prevented by their poverty and toil from having any share or part in that life.”

Books are more stimulating and more enjoyable to read if they are guided by questions and if they cause questions themselves. This book has been written in the style that it provides answers and thus it must be feared to be a dull read. It is too late to change that style. However, some questions are: (1) How is it possible that Europe has an unemployment of about 10% for more than three decades now, and the USA the mirror image of poverty ? (2) Can we really trust our governments ?

With this book ends a project that basically started with the Fall of the Berlin Wall in 1989. My hope is that this book contributes to the fall of some other walls, i.e. the intangible mental ones, consisting of perceptions and conventions - but equally confining.
Contents in Brief
Book I Introduction 11
Book II Trias Politica and Economic Supreme Court 16
Book III Economics ‘as usual’ 36
Book IV Presentations for the general public 60
Book V Methodology: Definition & Reality 69
Book VI Structural models 87
Book VII Social Choice 158
Book VIII Supportive notions 186
Book IX Reduced form 198
Book X Conclusions 216
Appendices 266
The symbol ° is used to indicate market clearing equilibrium (and possibly expectational). The symbol * or E[.]is used for expectations and expectational equilibrium (and possibly market clearing). The symbol °* is used for both, and · for the one or the other (and possibly both).
Contents
Book I Introduction 11
1. Order of presentation 11
2. The general theory 11
3. Methodology 14
Book II Trias Politica and Economic Supreme Court 16
4. The Trias Politica 16
5. The economic record of the 20th century 18
6. An Economic Supreme Court 24
7. Position of the Court in economic theory 26
8. The record of economics itself 26
9. Economics ‘as usual’ and its inadequacy 30
10. Four empirical cases 32
11. The moral imperative 33
Book III Economics ‘as usual’ 36
12. Introduction 36
Stylized history 38
Structure of the argument 41
The difference that it means 42
13. Unemployment via taxes and minimum wage 43
The earnings distribution 44
Analysing the minimum wage 44
The Tax Void 47
Cause of the Tax Void 48
Development of the Tax Void 51
Marginal tax rate & VAT 53
Marginal tax rate & dynamics 54
Spillover and domino effects 56
Diagnosis and Therapy 56
Stagflation resolved 57
14. The 1974 Duisenberg disaster 59
Book IV Presentations for the general public 60
15. Unemployment solved ! 60
16. Enable Russia to help itself 64
Parallel 64
Risk not chance 65
Internal not external 65
Conclusion 66
17. Will the West repeat Versailles ? 66
Book V Methodology: Definition & Reality 69
18. How to check ? 69
19. Dealing economically with concepts 70
Maximising information power 70
Pythagoras and the circle 73
Falsification 76
Determinism and free will 78
From stylized fact to definition 82
Relating to Hicks 1983 83
20. Structural and reduced form 84
21. Direct application to the Economic Supreme Court 85
22. Methodological summary 85
Book VI Structural models 87
23. A textbook macro-economic model 87
The IS-LM model 87
The production function 89
Dynamics versus statics 90
Phillipscurve 90
Macro-economic interactions 91
24. Heterogeneity and nonlinear taxation 92
Heterogeneity versus homogeneity 92
Nonlinear versus proportional taxation 93
Some literature 93
25. Summary of current views 94
A simple view 94
A complex view 96
Efficiency wages intermezzo 96
A more sophisticated view 97
Confusions 98
26. Heterogeneous labour 99
Dromedary supply 99
Dutch income distribution data 100
Definitions and formulas 102
Amendment to the textbook model on the Phillipscurve 106
27. Subsistence 106
Definitions 107
Economic literature 109
Types of indexation 109
Formal development 110
28. Phillipscurve 115
Concepts 115
A homogeneous Phillipscurve 118
On expectations 121
Heterogeneous Phillipscurves 122
More factors that cause a shift 122
Crowding out 123
Poverty 124
The submarket Phillipscurves 125
Shifting back 125
29. Tax basics 126
Taxes and premiums 126
Common structure 127
Nonlinear tax function 128
Exemption 129
The marginal rate 140
Balanced growth 143
Off balanced growth 144
30. Dynamic curvature of the tax wedge 145
Introduction 145
Formulas 145
Graphs 147
31. Differential impact of the minimum wage on exposed and sheltered sectors 149
Introduction 149
Model 151
Graphs 152
Tables 154
Conclusion 155
32. Dynamic optimality 155
The Phillipscurve revisited 155
Investment, growth and productivity 156
Book VII Social Choice 158
33. Introduction 158
34. The solution to Arrow’s difficulty in social choice 159
Introduction 159
Basic concepts 162
Restatement of Arrow’s Theorem 165
A note on the name of APDM 167
A lemma 167
Rejection of the Arrow Moral Claim (AMC) 168
Rejection of the Arrow Reasonableness Claim (ARC) 168
Selection of the culprit axiom. 169
Examples of consistent constitutions 170
A reappraisal of the literature 170
Conclusion 172
Addendum: Sen’s restatement in “Development as freedom” 172
Addendum: Mas-colell, Whinston and Green, “Microeconomic Theory” 175
35. Without time, no morality 175
Introduction 175
Control of natural forces in the social process 176
Three traditional methods 177
Borda Fixed point 178
Relation to Saari’s work 179
Pareto 182
A note on cheating 182
Conclusion 183
36. Some notes on ethics 183
Book VIII Supportive notions 186
37. On the nature and significance of a free lunch 186
Some quotes 186
Consumers surplus 187
Economic growth 188
Conclusion 192
38. Proper definitions for uncertainty and risk 192
Uncertainty 192
Risk 193
Example 195
Wrong use in economics 1921-2005 196
Book IX Reduced form 198
39. The possibility of full employment in the welfare state 198
Introduction 198
Stylized facts 198
Concepts 199
The theorem 201
Graphical presentation 205
40. The possibility of co-ordination 206
Stylized facts 206
Concepts 207
The special theorem 211
The general theorem 213
On the interaction of the reduced form theorems 214
More on chance 215
Book X Conclusions 216
41. Relating to Mankiw’s “Principles” 216
42. Relating to Krugman, Phelps, Ormerod and Heilbroner & Milberg 219
Introduction 220
Review of positions and qualities 220
Krugman: “We don’t know” 222
Phelps: “Structural slumps” 224
Ormerod: “Death of economics” 228
H&M: “Crisis of vision” 230
All authors 232
43. Relating to Sen, Galbraith and Cox & Alm 232
Sen: “Development as freedom” 232
Galbraith: “Created Unequal” 235
Cox & Alm: “Myths of rich and poor” 242
44. Relating to the OECD and some of its authors 246
The OECD in general 246
The EITC, direct payroll tax reduction and wage cost subsidies 247
45. After 35 years of mass unemployment: An advice to boycott Holland 250
Summary 250
Introduction 251
First considerations 251
The realism of my advice 254
George W. Bush and Iraq and the American economy 254
More on Paul Krugman 256
The Dutch tragedy of the murder of Pim Fortuyn in 2002 256
On the European Enlargement 259
Advice to vote NO on the current proposals for a European Constitution 260
A note on my own position 261
Appendix: After 20 years of mass unemployment: Why we might wish for a parliamentary inquiry 262
46. Final conclusion 263
Epilogue 264
Appendices 266
On the definition of economics 266
Biographical note on Montesquieu 270
Price inflation and wage growth in Holland 1950-2002 272
Income distribution in Holland 1950 and 1988 273
Program used in the analysis on exposed and sheltered sectors 275
A note on Hayek 276
A note on Barrow’s “Impossibility” 278
A constitutional amendment for an Economic Supreme Court 279
A parallel argument on the Central Bank 281
About the US Council of Economic Advisers 282
From the “Employment Act of 1946” 282
Martin Feldstein on the US Council of Economic Advisers 283
Commenting on this 288
Presentation for the National Press in Washington 1993 289
Clinton administration EITC plans for 2000 293
Summaries of additional papers 298
A note on the New Economy (2000) 299
On the 2005 edition of this book 300
Autobiographical note 303
What is new in this analysis ? 305
Abstract 306
Literature 311
Index 323
The basic idea of this book is that Keynes’s General Theory is generalised even further by including endogenous government in the model so that we arrive at a truly general Political Economy. The argument can be presented in a top-down fashion, for example by repeating the IS-LM model before the amendments are introduced. This order appears to be uninviting and therefor the argument is presented in a bottom-up fashion. We better discuss the amendments before we look at the consequences for theory as a whole. We start with the new economic synthesis and the argument for the Economic Supreme Court, since these motivate the book.
Political Economy is the science of the management of the state. More in general, ‘economics’ is Greekish for ‘management theory’. [1] Marshall already explained that ‘economics’ is wider than ‘political economy’, see his “Principles of economics” (1947:43). The proper definitions are:
· Economics ‘in a narrow sense’ puts the approach, methods and tools, of the discipline central, and looks at a variety of subjects.
· Political Economy puts the subject, the management of the state, central.
· Economics ‘in a broad sense’ joins the ‘narrow sense’ and Political Economy.
One way to view these distinctions is to visualize a matrix with the sciences in the rows and the subjects in the columns. The common economist may to some extent neglect the inputs of the other disciplines, but the political economist must draw on the resources of philosophy, history, law, sociology, politicology, social psychology, biology, physics and so on. [2] Political Economy is, just by definition, the study that tries to integrate all human knowledge about the management of the state. Political Economy is, in that respect, the proper continuation of ancient philosophy on that subject matter.
Confusions easily arise when these definitions are not understood. [3]
The reasons to adopt these definitions are rather mundane. The King - and the ruling elite - can derive their wealth (a) from exploitation or (b) from general productivity growth. The latter is more advantageous in the longer run. [4] Productivity can be increased in basically two ways: by technology or by management. For example, computers can add to our wealth, and we must have technology to be able to have computers. But a room full of computers does not present much value if we don’t manage their use. So technology and management are the two sides of the coin of human wealth. Though no study should neglect either side, there of course is advantage in some specialisation of those studies. The engineers take one side, the economists the other.
Psychologists and artists might object to that view, and argue that proper training in enjoyment and in particular the arts could teach people to enjoy life so much more, requiring neither additional engineering nor economics. In a sense, this viewpoint would seem to be correct. In another sense, it apparently isn’t sufficient. Human beings get used to levels of wealth, and require more wealth. It would be economics again to study why people are not happy eating bananas and watching sunsets. And dealing with issues like this, is management again.
Also, when writing this in 2000, and again 2004, there are some rumours about the ‘end of the state’ and the ‘loss of power of existing nation-states’. This clarifies that the definition of ‘Political Economy’ subsequently requires a definition of the ‘state’. I will not try to give that here. [5] For the purposes of this book it suffices to take the existing nation-states, and international governmental bodies, and we can reconsider that assumption when they all drop their constitutions.

Then: The economic process can be understood much better if economic policy making itself is included as one of the factors, and then is studied from the Public Choice perspective. The basic proposition of this book hence is that we can extend the current ‘neoclassical synthesis’ by including endogenous government in the model, so that we arrive at a truly general Political Economy.

This extension causes the subsequent proposition that it would be advisable for a democratic society to create an Economic Supreme Court as a separate power in the constitution next to ‘Trias Politica’ of the Legislative, Executive and Judicial branches.

It is useful to recall that economics does not restrict its attention to ‘income’ only, but also considers rights and duties. Coase’s theorem is a good result in an older tradition. Sen (1999)’s “Development as freedom” is a welcome refresher. Beckerman (1999) explains that when economic growth causes our grandchildren to be wealthier than us anyhow, that we should rather focus on bequeathing a good system of justice rather than try for even more growth. So, it is quite natural in Political Economy to also consider the law.

The basic argument is the following. Governments already have economic planning bureau’s - the US for example have the Council of Economic Advisers to the President. [6] Current forecasts are conditional on the assumption that the government will do as planned and promised. Such forecasts often fail, and can be forecasted to fail if one takes an independent position. Proper forecasting requires that the economic adviser not only has a scientific attitude, but also a scientific position, and is able to tell and indeed tells the public that plans or promises will fail if there is scientific reason for thinking so. Given the experience of the 20th century, it appears that strong constitutional safeguards are required to provide for this public function. Hence an Economic Supreme Court.

Keynes (1936) already formulated a ‘general theory’ for political economy. Keynes subsumed the ‘classical’ approach as a special case. [7]
Keynes’s theory is rich in many respects and poor in other. On the poor side: Keynes’s book is not exact on many issues, and proper models like the IS-LM model were only developed by Hicks, Meade and others. Samuelson (1947) presented the first integration of both the competitive model and the utility maximising calculus, only then giving body to the notion of ‘classical’. [8] However, on the rich side: Keynes’s book was and still is a source of inspiration for new research angles. Note that Samuelson coined the phrase ‘neoclassical synthesis’ for ‘his’ conceptual integration of classical processes at the micro level and Keynesian processes at the macro level. This synthesis endures till today, as e.g. Colignatus (1990a), Blanchard (1999) and Krugman (1999) acknowledge. It is important to note, though, that Samuelson’s phrase is a bit awkward, since Keynes himself already proposed such synthesis - he namely did not abandon micro-economics. It would be wrong to associate Keynes only with the macro-economic leg of the synthesis. Thus the neoclassical synthesis is actually the Keynesian synthesis itself. But we may as well use the phrase ‘neoclassical synthesis’, if only to acknowledge the role of others. [9]

Keynes remains vitally present, not only for reasons of polical economy but also in the standard macro-economic core. A student who considers recent textbooks on economics, such as Mankiw (1992 and 1998) or Dornbusch & Fischer (1994), notes that the core of macro-economics still derives from Keynes (1936) and from the interpretation of his theory by the IS-LM model developed by Hicks (1937) and others. The ongoing discussion since 1936 can only be understood by properly including these original theoretical roots. Krugman gives a useful refresher in his “The return of depression economics” (1999). Flanning & Mahony (1998, 2000) provide a recommendable modern summary companion to The General Theory that is a testimony of its relevance. The theoretical extension with the Phillipscurve in its relation to unemployment and inflation belongs to this tradition. Also practical economic modelling, such as the models Athena and MIMIC of the Dutch Central Planning Bureau rely on that macro-economic core, see CPB (1990) and Graafland and De Mooij (1998).

There are also good reasons to remain modest about the novelty of the ‘new synthesis’ proposed in these pages. Keynes had an open eye to the policy making process and social philosophy. Similarly, Public Choice theorists like Buchanan and Tullock have not suggested that other factors like the macro-economy itself were not important - they only emphasised the importance of Public Choice. In that sense the presently proposed extension with institutional economics, information and Public Choice is no real extension.
In addition, the three pillars of the Trias Politica are not fully independent already. There are rather numerous dependencies instead. A modern nation has decentralised much power, and created hundreds of ‘independent organisations’ - so that some speak about ‘myrias politica’ instead of ‘trias politica’.
However, from the very definition of ‘political economy’ it follows that the function of analysing, theorising and forecasting the management of the state is a part of management itself, and this function indeed can be in danger of the other three branches.
A nation that will adapt its constitution to create an Economic Supreme Court will still feel that it takes a historical step. Similarly, economists would feel the change of perspective. It would be a different world, for example, if the US Council of Economic Advisers to the President would honestly state that they ‘would rather veto the Budget’ if they really would think so; and if they would become subject to criticism from the profession if they wouldn’t start behaving like this. So, speaking about a new synthesis is of major significance. And it can be shown to be crucial.
Methodology appears to be important in this book. Sometimes, paradigm shifts are as much a matter of methodology as a matter of content.
One example is Keynes. As an economist, Keynes emphasised the economic content of his analysis: notably his findings on the peculiar role of money in the economy. His observation is firstly that money is both a medium of exchange and a store of value, and secondly that storage value depends upon expected value: and then his analysis on expectations takes off. In retrospect the force of Keynes’s analysis is a bit less ‘economics’ than he thought, and has more to do with the handling of time than with money per se. Samuelson (1947, 1983:117) and Grandmont (1983) showed that the analysis can be reproduced if money is entered in the utility functions. What remains is the issue of time. From a methodological point of view, Keynes’s theory is general in that it extends economic equilibrium with the notion that market non-clearing disequilibrium such as unemployment could be a state of expectational equilibrium too (a different concept of equilibrium). And money need not be the only cause, witness for example the difficulty of forecasting sales in order to set production. [10] [11] [12]
Another example of the relevance of methodology appears to be Samuelson (1947). Samuelson emphasises his interest in a general theory (that word again) of economic theories, and clarifies that such a theory (i) should apply to various circumstances and (ii) be meaningful (as opposed to being a tautology). Samuelson clearly presents his argument as a methodological one. [13]
Originally, the draft of this book started out with methodology, but this discussion now has been moved downwards, to a place where one will better appreciate its argument and the need for it.
Montesquieu published his De l’Esprit des Lois in 1748. An English translation can be found on the internet, and a short biographical note, taken from there, has been included in an appendix. Though his book discusses many issues, it remained famous for the theory of the separation of powers, i.e. of the Legislative, Executive and Judicial branches of government. The American phrase is ‘checks and balances’. A key passage in Book XI shows that Montesquieu also refers to the existing case of England - so that his role is not one of originator but one of keen observer and developer of theory:
“One nation there is also in the world that has for the direct end of its constitution political liberty. We shall presently examine the principles on which this liberty is founded; if they are sound, liberty will appear in its highest perfection.
To discover political liberty in a constitution, no great labour is requisite. If we are capable of seeing it where it exists, it is soon found, and we need not go far in search of it.
6. Of the Constitution of England. In every government there are three sorts of power: the legislative; the executive in respect to things dependent on the law of nations; and the executive in regard to matters that depend on the civil law.
By virtue of the first, the prince or magistrate enacts temporary or perpetual laws, and amends or abrogates those that have been already enacted. By the second, he makes peace or war, sends or receives embassies, establishes the public security, and provides against invasions. By the third, he punishes criminals, or determines the disputes that arise between individuals. The latter we shall call the judiciary power, and the other simply the executive power of the state.
The political liberty of the subject is a tranquillity of mind arising from the opinion each person has of his safety. In order to have this liberty, it is requisite the government be so constituted as one man need not be afraid of another.
When the legislative and executive powers are united in the same person, or in the same body of magistrates, there can be no liberty; because apprehensions may arise, lest the same monarch or senate should enact tyrannical laws, to execute them in a tyrannical manner.
Again, there is no liberty, if the judiciary power be not separated from the legislative and executive. Were it joined with the legislative, the life and liberty of the subject would be exposed to arbitrary control; for the judge would be then the legislator.
Were it joined to the executive power, the judge might behave with violence and oppression.
There would be an end of everything, were the same man or the same body, whether of the nobles or of the people, to exercise those three powers, that of enacting laws, that of executing the public resolutions, and of trying the causes of individuals.”
It is useful to recall Montesquieu’s definition of political liberty:
“We must have continually present to our minds the difference between independence and liberty. Liberty is a right of doing whatever the laws permit, and if a citizen could do what they forbid he would be no longer possessed of liberty, because all his fellow-citizens would have the same power.”
Thus, of key importance: A person with few means can take less advantage of his liberties than a person with more means. A person with insufficient means might be regarded as not free at all. This brings us to the economic amendment to Montesquieu’s heritage.

There appears to be a clear link between Montesquieu and Adam Smith. In his preface to his edition of Smith (1776; 1974), Skinner explains that Smith used the historic method to provide him with empirical input (rather than econometrics). Quite fittingly, Skinner writes:
“(…) it was Montesquieu rather than Voltaire who provided the most important impetus to their studies. Montesquieu was widely regarded as the ‘greatest genius of the present age’ and his Esprit des Lois came to be enjoy a considerable vogue in the circle of Smith’s friends. But while Montesquieu’s work provided an important stimulus, the Historians in general, and Smith in particular, went well beyond the teaching of the master. In the words of one of their number: ‘The great Montesquieu pointed out the road. He was the Lord Bacon of this brand of philosophy. Dr Smith is the Newton.’” (p30)

The limitations of the Trias Politica with regards to economics are a well-known theme. Marshall’s “Principles of economics” opens with the painful story of poverty - as Mankiw unfortunately waits till p421.
David M. Kennedy (1999:245), “Freedom from Fear; The American people in Depression and War”, quotes Roosevelt in a special message to the US Congress on June 8 1934:
“(…) ‘the interdependence of members of families upon each other and of the families within a small community upon each other’ provided fullfillment and security. But those simple frontier conditions now had disappeared. ‘The complexities of great communities and of organized industry makes less real these simple means of security. Therefor, we are compelled to employ the active interest of the Nation as a whole through government in order to encourage a greater security for each individual who composes it.’ The federal government was established under the Constitution, he recollected, ‘to promote the general welfare,’ and it was now government’s ‘plain duty to provide for that security upon which welfare depends’. (…)”
Unemployment and poverty can be seen as indicators for the quality of the management of the state. They are social phenomena, and thus depend upon the rules that society defines. When they exist, then apparently something is wrong with the management.
The economic record of this century may be judged with mixed feelings. Much has been achieved, but much has gone wrong too:
1. Two World Wars.
2. The Great Depression 1930 - 1940.
3. The Great Stagflation 1970 - the present (2005). [14]
4. Disputable ways for decolonisation and development co-operation.
5. The economic disaster in Russia and Eastern Europe after the Fall of the Berlin Wall.
6. The environment.
Of this record, the wars are the focal points of attention.

Wars are disasters for the common citizen. Perhaps wars need to be fought for political reasons, but, an economist can express some doubt. In fact, Keynes wrote his General Theory with an eye to the threat of war:
“War has several causes. Dictators and others such, to whom war offers, in expectation at least, a pleasurable excitement, find it easy to work on the natural bellicosity of their peoples. But, over and above this, facilitating their task of fanning the popular flame, are the economic causes of war, namely the pressure of population and the competitive struggle for markets. It is the second factor, which probably played a predominant part in the nineteenth century, and might again, that is germane to this discussion.”
John Maynard Keynes, “The General Theory of Employment, Interest and Money”, 1936:381-382

Skidelsky even makes a strong case that it took the War for people to start listening to Keynes:
“In his biography of Keynes, Sir Roy Harrod reports a widely acclaimed speech delivered by his subject to the House of Lords in 1946, the year of his death. ‘But Keynes had been talking in this style ... for some twenty-seven years. Why had his words not been listened to .... ?’ (...) Unemployment as a problem in economic theory may have been sufficient to produce a revolution in the discipline; unemployment was not a sufficient problem to society to produce a revolution in political ideas. If it was not the prolonged experience of mass unemployment that finally broke the hold of nineteenth-century ideas, what was it ? A strong case can be made out for war. ‘Normal’ life could coexist with unemployment; it could not with modern war.”
Robert Skidelsky, “The reception of the Keynesian revolution”, in Milo Keynes, “Essays on John Maynard Keynes”, CUP 1975:89 & 102-103

Kennedy (1999) makes clear that ‘Keynesian’ elements like maintaining aggregate demand were prominent elements in even Herbert Hoover’s policies. Similarly, deliberate inflation was considered by Roosevelt e.g. to help farmers reduce their debt burden. Nevertheless, Kennedy has to write: “In the ninth year of the Great Depression and the sixth year of Roosevelt’s New Deal [i.e. 1938 /TC], with more than ten million workers still unemployed, America had still not found a formula for economic recovery.” (p362) There was contact between Roosevelt and Keynes, but with little effect - Roosevelt apparently regarded Keynes pejoratively as an academic theorist. Then:
“Deprived of adequate public or private means of revival, the economy sputtered on, not reaching the output levels of 1937 until the fateful year of 1941, when the threat of war, not enlightened New Deal policies, compelled government expenditures at levels previously unimaginable.” (p360)
The policy stagnation around 1938 is the more surprising, since Kennedy reports Roosevelt saying on a Fireside Chat at that time (April 14 1938): “History proves that dictatorships do not grow out of strong and successful governments, but out of weak and helpless ones.” (p362)

Keynes is an amazing person also on the following. Skidelsky makes another important point about Keynes’s role in the aftermath of the First World War in turning people’s attention from geopolitical power to economic growth:
“None of this is to deny that The Economic Consequences of the Peace was a very influential book. Of the dozens of accounts of the Treaty which appeared in the 1920s it is the only one which has not sunk without a trace. It captured a mood. It said with great authority, flashing advocacy and moral indignation what ‘educated’ opinion wanted said. It also had an influence at a deeper level. Wickham Steed was right: it was a revolt of economics against politics. The war had been fought in the name of the nation, state, emperor. These, Keynes argued, were false gods, from whom he sought to divert allegiance towards economic tasks. It was a message calculated to appeal to the nation of Cobden and Bright, once it had recovered of its intoxication with military victories. It helped form the outlook of a new generation. The nineteen-twenties saw a new breed of economist-politician, who talked about the gold standard and the balance of trade as fluently as pre-war politicians had talked about the Two-Power standard and the balance of power. (…) The idea that the creation of opulence was the main task of rulers was born in 1919 though it came of age only after the Second World War.” Skidelsky (1983:399). [15]
Reading this, one would tend to think that there still is a risk when politicians get involved with the economy.

The Trias Politica setting is usefully limited to the nation-state. However, if we were to limit our attention to the nation-state, could we really neglect the external conditions ? One would think not. A crucial chapter in the theory of the nation-state concerns the external relations: trade and war by tradition, and then, in our age: the risks of world population growth and of environmental disaster, i.e. risks that may spill over across the border. Wise managers would not close their eyes to external risks. Hence, though this book concentrates on the situation in the Western democracies, we also regard the non-democracies in the developing world.
Projections for the future indicate such external risks:
“The Global Crisis scenario (...) explores the risks and dangers of a neglect of, and late response to regional and global challenges (...) the world may end up in the throes of widespread distress, an eco-crisis, which can only be corrected at high cost. The policy message conveyed by this scenario is abundantly clear. Dismissing this scenario as unduly gloomy and pessimistic is in our view, absurd; such a statement would be tantamount to a complete denial of large segments of twentieth-century history.”
Centraal Planbureau, “Scanning the future”, SDU 1992:211

World population is forecasted in 1999 to rise to 9 billion around 2050, with a forecast error of 1.5 billion lower or higher. The central forecast already is a reduction from a forecast of 9.5 billion as the result of AIDS. This disease not only kills, but also reduces the quality of life for the surviving. Other diseases may well develop. Or, for AIDS itself, given the huge number of infected, a mutation could develop that can be transferred by flies or mosquitos too - that already transfer diseases. Another problem is that when policy succeeds in improving a situation, then such new room tends to be taken up for growth again. So it would be some kind of a miracle if the world would hit the ‘low’ 7.5 billion target with a healthy, well fed, educated and peaceful population.

UNDP administrator Speth correctly states:
“Fifty years after the adoption of the Universal Declaration of Human Rights, one third of the world’s people are enslaved by a poverty so complete that it denies them fundamental rights.” (UNDP 1999 internet site)
This quote usefully recalls to memory that Montesquieu’s liberty has been extended in this century with more rights, so that there is an even stronger intellectual case to test whether the system of Trias Politica serves the demands made on it.
Amartya Sen’s “Development as freedom” (1999) is along this line of reasoning.

The hypothesis of self-interest clarifies that Western nations are less interested in the development issue. Surely, if the Democratic State knew that economic policies were feasible that would make external development Pareto improving rather than wasteful, then it would deem it wise to pursue such a course. And part of the argument in this book is that such knowledge does not get the attention that it deserves. On the other hand, we should presume the lack of that attention, and the lack of sufficient knowledge. But we can still argue that the current world development situation should provide the West with some worry anyhow.
For Western democracies, current situations in the developing world might be regarded as replays of their own past, and as forecasts for their own future - if times of distress were to return again. A 1996 UN-WIDER statement was:
“Thus, man-made crises have become a serious, perhaps the most serious, threat to human security in the present world.” [16]
“Over the last ten years, the number of humanitarian crises has escalated from an average of 20-25 a year to about 65-70, while the number of people affected has risen more than proportionately. The International Red Cross estimates that the number of persons involved is increasing by about ten million a year. As a result, scores of people have been left dead, maimed, starving, displaced, homeless and hopeless. Afghanistan, Bosnia-Herzegovina, Burundi, Cambodia, Central America, Haiti, Liberia, Sierra Leone, Rwanda and Transcaucasia are the countries or regions where the most acute crises have occurred during the last two decades. In turn, Guyana, Kenya, Surinam and Zaire are nations where negative trends in the factors under analysis make many fear that social explosions may take place in the not too distant future, unless corrective measures are introduced urgently.” (idem)
E. Wayne Nafziger (1998), of UN-WIDER, reports in the Financial Times:
“Many people believe that humanitarian disasters are ethnically determined, arising from differences of language, race, tribe or national origin between disputants. These differences, it is thought, are so deeply rooted that they are not amenable to economic and political reform: violence cannot be avoided. That is too pessimistic a conclusion. Our research focuses on the contribution to humanitarian crises of two factors: national income and the role of the government. Both provide some reasons for modest optimism, or at least subjects for action. (…) An analysis of the root causes of humanitarian crises indicates that the mechanism for preventing them are primarily macro-economic.”

Then, there are Russia and Eastern Europe after the Fall of the Berlin Wall in 1989. The risks of turmoil in Russia, while nuclear weapons are abundantly about, were already evident in 1989, and indeed we have seen an attempted coup against Gorbachev and later the bombing of the Duma parliament building. Eastern Europe had the criminal actions of Milosevic. The risks with respect to Russia still exist. Both in 1989 and today in 2004 a reasonable expectation was and is that Eastern developments would and will be positive. But the crucial issue does not concern the average, but the risk. Who understands the economics of unemployment will see that Western economic policy is deficient on this point - a topic that we shall return to.
In the middle of 1999 the UNDP also published a report on Eastern Europe. The conclusion is that there is much more misery than commonly recognised, and that most misery is needless and also a result of wrong decisions by Western governments. In an interview with director Kruiderink, a key question and answer is:
Q: “According to some experts it went wrong precisely since the economic reforms did not go far enough.” A: “Nonsense. The ruin would only have been greater. No, precisely the reform of the state should have been the main target. Some people actually said that ten years ago, but they were not listened to. They were considered to be softies, since they wanted to maintain parts of the communist system. You currently see economists of the Worldbank and IMF slowly change their minds too.” [17]
What is crucial is that the methods, by which such dissenting ‘softies’ were silenced, were unscientific. Crucial policy preparations were left to the fric and fray of politics and bureaucrats, unworthy of a decent democracy.

There is Robert Barro’s research in the relationship between democracy and growth. An early report is in Barro (1996) [18] but he has been working on it since. His results suggest that it first takes a certain level of income before democracy has a chance. This reminds one of the willingness of Westerners to accept dictatorships in developing countries as long as economic welfare is increasing. Four comments can be made: The present discussion is targetted at existing democracies, and Barro’s finding then is only relevant as a warning of what could happen if the risk of, say, an eco-crisis would materialise. Secondly, Barro seems to imply that current democracies are finished, and that there is no next stage. But we can advance. Thirdly, once the concept of an Economic Supreme Court is clear, then one could imagine that a dictatorship on the way to a democracy (notably China) could first install such a Court - and the rule of law - before it moves towards elections. Finally, we should read Sen (1999a) as an answer to the Barro analysis, since it could rather be that democracy futhers development and growth.

Above uses plain human survival to judge on the economic record, it focusses on war, humantarian disasters, overpopulation, diseases, environmental deterioration. It is sobering to regard the more standard economic outcome. Table 1 reviews the unemployment in the European Union for 2003, reassembling the data after the enlargement of May 1 2004.
Table 1. Unemployment in the European Union in 2003
|
Eurostat [19] |
EU (after enlargment) |
EU 15 |
|
Total population |
451 million |
378 million |
|
Unemployed |
19.0 million |
14.2 million |
|
Idem, % labour force (age 15+) |
9.1 % |
8.1 % |
|
Participation [20] |
72.0 % |
72.4 % |
The unemployment figure excludes many welfare state benefit recipients who could work when judged from other standards. For example, there is the well-known case of ‘disability’ with a major fraction of hidden unemployment, see OECD (2003). A hypothesis in public choice theory is that policy makers in the past solved part of their problem with unemployment by allowing an increase in these other welfare programmes. The recent focus in the policy debate is upon increasing participation again, shifting people from such arrangements back into the labour force. This debate however runs into the problem of unemployment again. Disability, sickness, early retirement and welfare relief might be reduced (by reducing problems in the bureaucracy, solving principal-agent problems, and by adjusting definitions, reducing entitlements), yet it might well cause higher unemployment again and thus only shift the problem. A major insight thus is that unemployment remains the root problem for macro-economic policy making. It is proper that we pose the question: why is it that the EU doesn’t achieve more employment ? This question can best be answered by taking a long run point of view - which is not the standard economic point of view.

We can conclude this chapter as follows. The economic record of the last century is mixed, and human suffering was large. For the future: there still are serious risks. Bad economic conditions don’t necessarily result into wars. During the Great Depression the US remained a democracy and didn’t resort to fascism. Though it came close ! [21] Nevertheless, there can be situations in which certain politicians can rise to power by exploiting social, religious and racial sentiments - which sentiments actually draw on economic distress and uncertainty. Such is actually the rule, and stable democracy is rather the exception. Though the probability of such developments might be limited, in the currently affluent West, their costs would be great, and hence the risk may be sufficiently large to try to do something about it. If the system already fails now, what may happen if circumstances would turn out to be far less favourable ?
Since Western societies since the Second World War already have much experience with standard approaches to enhance economic security, and are apparently failing to a large degree, it becomes time to look for a more fundamental approach. We may look into the very process of economic policy making itself.
Since the problem is found to be equal across nations and across time, we may look for common factors. The basic factor that we can identify is the Trias Politica structure of Western democracies. The present checks and balances are imperfect. This structure appears to allow too much leeway for forces that are detrimental to the economic well-being of the population at large, their economic security and their possibilities for the pursuit of happiness. The structure of economic policy making allows politicians, bureaucrats and special interest groups too much room to distort the contribution of economic scientists.
The conceptual scheme of the Trias Politica was a useful ladder to climb out of the situation of feodality and absolute kings. But a ladder is not a goal in itself. Democracy is a living concept and can develop further. If we find that the Trias Politica fail with regards to our needs, then we should adapt it.
In the past there have been two steps towards more independence and more checks and balances in the management of the economy. First there was the independent Central Bank, and then the separate Council of Economic Advisers to the government (or other planning body). Indeed, the situation after the Second World War has been much improved: instead of a Great Depression we only got a Great Stagflation.
Okun (1983), “The economist and Presidential leadership”, provides an recommendable account of current practice. Two quotes are particularly relevant, one that observes current partiality and one that advises impartiality:
“Given these constraints, members of the Council of Economic Advisers are clearly recognized as the President’s men. If they speak publicly, they will be identified as spokesmen for administrative positions.”
“One wishes for a more effective way of influencing public and congressional opinion in the areas of professional consensus. There is a role to be played by a Supreme Court in the profession, although a less important one than that actually fulfilled by the Council and the Bureau of the Budget in recent years.” (p580)
We are advised to go one step further than the current situation, and create a scientific Economic Supreme Court safeguarded within the Constitution as an equal partner next to the three of the Trias Politica. Its role will be limited, but crucial.
The argument is not that politicians could not be qualified in economics. The argument is the balance of power. Having an Economic Supreme Court increases democracy, since it improves the quality of the checks and balances. It caters to the civic right of good government and to the right to know.
The crucial considerations are:
· The first point is theory dependence. The State will decide on its policy while using an economic model. Hence policy is directly dependent upon the state of economic theory. Who is going to decide what the current state of theory is ?
· The second point concerns self-reference (reflexiveness). The model contains a submodel of State instruments. Clarity requires that policy itself is clearly formulated and put into the model too (with error terms to allow for possible discretion).
· The third point is conflictive self-reference. One can conceive the situation that the government announces a policy while the true scientific forecast shows that the policy is untenable and will be repealed later. Hence there is an internal source of conflict - the worst kind, not a dysfunctional person, but a logical knot.
· Finally, there is a ‘general conflict of interests’. Governments have more objectives, and any power group might want to exert its influence anyway.
It follows from this that the Constitution should warrant for the Economic Supreme Court:
· It would be possible for the Court to use a model with an endogenous government. The Court would scientifically forecast government actions, instead of conditionally. The conditional forecast assumption that government promises will be kept and government assumptions realised, will be dropped.
· As the Court will have a scientific base, there can be publications and discussions with different analyses, and these would not by themselves mean a breach of confidentiality.
· The Court cannot exist without some power.
It would suffice for the Court to have the power to veto the national budget if the information that the Executive presents or uses for the budget is scientifically incorrect (in the judgement of the Court). The information and statistics only. The Court will focus on the statement on the deficit and the national debt, since all errors accumulate in those figures - though it can call any number or piece of economic information into question. Parliament of course keeps the power to decide on the budget and on policy. President and Parliament would lose the power to make misleading statements as judged by the Court.
An appendix contains a draft constitutional amendment as an example, to start thinking about it. The appendices also contain a description of the current US Council of Economic Advisers, and the difference should be clear - e.g. where the CEA appears to have no scientific status.
With an Economic Supreme Court in place, a downside is that a nation could get stuck in a specific economic theory. A Court could believe in Monetarism while reality would require something differently. Indeed, Keynes himself addressed his General Theory to his fellow economists, who were as conservative as politicians in rejecting his proposals about fighting the Great Depression. To answer this: Such stagnation in policy making can happen nowadays too, but the situation with a Court is much more transparant. Also, the very job of the Court requires it to pay attention to the data, and this tends to make for eclectic views.
It is useful to indicate in more abstract terms what this book does. Unemployment is not taken as a natural disaster like an earthquake, but regarded as the result of policy. The central questions in the political economy of employment are: can one solve unemployment and poverty, does one know how, and does one want to ?
Next to the budget set and preferences, it appears useful to distinguish information. Government policy making is not guided by prices as markets are. Perceptions play an special role. For example, when policy makers associate tax policy with income distribution policy, and in that manner overlook inefficiencies such as the tax void, then policies are blocked that would otherwise benefit everyone.
Colignatus (1990a) forecasted a revival of institutional economics. We see this happening in the literature indeed. This current book belongs to that development. An Economic Surpreme Court, or the lack of it, is a topic in institutional economics, and thus has a natural position in the proposed new synthesis. [22]
There have been precursors to this approach indeed. Galbraith (1998:199) correctly quotes Michael Kalecki (“Political aspects of full employment”):
“The assumption that a Government will maintain full employment if only it knows how to do it is fallacious.”
Economics is not a finished science. Hicks (1983) even rejects the notion of ‘science’ itself, and writes a chapter with the title ‘A discipline not a science’. (See also below.) He quotes Keynes:
“The Theory of Economics does not furnish a body of settled conclusions immediately applicable to policy. It is a method rather than a doctrine, a technique of thinking, which helps its possessor to draw correct conclusions.”

A joke is that there are as many theories as economists, and five for Keynes. Krugman (1994ab, 1996a) describes eloquently how Western economies came from full employment and a period of great expectations to a period with unemployment and inflation and a productivity slowdown, and as a result diminished expectations. He is even more eloquent in describing the different fashions in economics and economic policy making. He gives a brilliant discussion of Keynesians, Monetarists, Supply-siders, Business-cyclists, Post-Keynesians, Strategic Policy Adepts. Krugman also makes an apt distinction between serious economists and the policy entrepreneurs who abuse economics for schemes of their own. [23]

The discussion by Galbraith (1998) is also very useful to understand the history of economic schools in the last decades. I discuss this book in the final chapters.

There also is ample reason to be humble about econometric testing of theories or identifying regularities (see Hendry (1995)), and then we haven’t started yet on the quality of national statistical data. [24]

If we regard the role of economic theory itself, then we cannot overlook the error that economists made with respect to Arrow’s Theorem in the theory of Social Choice.
First of all, there has been a stagnation in theory development:
“Tullock sees public choice as a subject in which there was a burst of interest from the 1950s to the 1970s, but which has now ‘died out’ (p39). The cause of death was the set of unremittingly negative conclusions that issued from the analysis of the Condorcet and Arrow paradoxes.” Sugden (1999).
Secondly, it turns out that economists and Arrow himself gave a wrong interpretation to the mathematics. Below we will present a novel analysis with respect to the Arrow problem, and show that economists have run astray indeed. This gives another reason to be humble.
But, our discussion also provides clarity that social choice can be based on reasonable and morally attractive axioms. And thus there is a logical basis for a Court too.

Evaluating in general:
· Looking at this circus, it would be wrong to be only entertained. The proper point to see - the real upshot of Krugman’s books - is that the current government structure has little protection against this circus, the fads and fashions, the David Stockmans: and that this protection would be larger with a well selected Court. Note that the word ‘court’ has been chosen judicially: the job of this body is to make a judicious choice, a wise selection of all competing theories and approaches.
· It is useful to realise that the academia basically write for the journals, i.e. each other, and do not necessarily have the focus of analysing or predicting the national economy. Van Bergeijk c.s. (1997) point to these different focusses and the ‘dangers’ thereof. [25] The academic job also is to generate and test new ideas, not only the implementation of accepted theory.
· Another aspect of the distinction between the academia and practical policy advice is that only the first have the luxury of saying that they ‘don’t know it’. In policy advice this luxury basically lacks, and a decision has to be supported with the best information available. Much academic criticism on economic policy advice is overdone, since it does not take this condition into account.
· Also, economics has come far, and many economic models show similarities. So there is a body of ‘existing economics’ or ‘accepted theory’ and a rather firm scientific base. Let me indicate as such: the textbooks of Dornbusch & Fischer (1994), Mankiw (1992), Blanchard & Fischer (1989), Mueller (1989), research like Bruno & Sachs (1985), Layard, Nickell & Jackman (1991), Phelps (1994), and the practical work such as of the Dutch Central Planning Bureau (1990) (in which I participated) and Gelauff (1992). [26] [27] As Montesquieu for his Trias Politica referred to the existing example of England, we can point to Holland, where the Dutch Central Planning Bureau has earned itself a strong position, even to the extent that political parties have their programmes evaluated before elections. One can be severely critical of that CPB, precisely since it is no real Economic Supreme Court, but the current achievement is there, and is an argument for ‘promotion’.

If we regard the arguments for a court again, in the light of this evaluation of the record of economics itself, then:
· The issue is not quite the difference between unfinished science and finished science. Even if economics were to be like engineering with some finished science - like Keynes’s famous dentistry, where it would be easy to switch from one economist to another - then still there are always decisions to be made. How to interprete the data ? Is factor X now crucial or not ? Even if a science is finished, then its application to reality still is an art, and there are differences in the artists. One should realise that choices are made nowadays too, albeit hidden and not in the open, and with less scientific scrutiny as is advisable. Currently we have the President and Parliament deciding what will be the ‘information’ on which policy is based: and only too often they select that kind of presentation that suit their goals rather than the truth. The only suggestion here is to make procedures such that the result better serves democracy.
· It is important to see that we are dealing with a natural monopoly here. When the government has to establish its budget and thereby wants to rely on science, then there has to be an instance at which it is decided what the current state of science is. Even if one would ‘privatise’ forecasting, and have universities compete in bids for the contract, then there still is the decision which university to take for this year. By definition there is a monopolistic situation for that decision maker at that moment.You cannot compete that away. My analysis and advice is to embed that authority in the Constitution, and provide warrants that the critical decisions are taken in scientific manner.
· Thus crucially: If the government on the one hand would desire to use the results of scientific advice for its budget process, and on the other hand would not opt for an Economic Supreme Court, then its definitions would be logically inconsistent, and it would thereby tend to create a cause for dishonesty and improper manoeuvreing and thereby corrupt its processes. [28]
· We should realise that also law is no ‘finished business’. Our ancestors have opted for an independent judiciary, even though there is no unanimity about formulations and interpretations. But precisely since there is no unanimity, we need an institute to make a decision - a court.
· It will also be useful here to recall one of the key aspects of being a scientist: namely the responsibility to make up one’s own mind. The scientist is in this respect as a judge. He or she has to balance all pro’s and contra’s, to review theories and facts, to replay all opinions of the colleagues, and then make a decision as to what he or she believes is the right thing to think. For example, to let one’s opinion to be swayed by the opinions of others is unscientific. Now, in the light of the enormous complexity of an economy, and the additional complexity of human made theories about the economy, many academics have the liberty to choose not to ‘believe’ anything - except the logical consistency of the paper that they read or write. But in policy advice, this luxury, as said, is lacking, and much more scrutiny of what one really believes, in terms of probable effects and such, is required.
Economists can be aware of the problems posed here; but then they tend to look for solutions within the given framework of the Trias Politica:
“There may be a communication problem. Using the words of Cairncross, again: ‘Policymakers as a rule are slightly deaf: there is too much noise’. In other words, there is a need to raise the ‘signal-to-noise’ ratio. One cannot overemphasize the importance of the packaging — the simplicity and saleability of ideas and the need to pursue these in clear and non-technical language using simple diagrams, etc. Moreover, often the more important contributions of economic advisers are in the clarification of the most basic and simple (simple only for us, professionals) concepts (...)” Bruno (1990:276)
The suggestion to my fellow economists is contrary: Thinking within the framework of the Trias Politica rather is a waste of time. It is like working from within astrology to arrive at astronomy.

Above discussion is at the constitutional level. It is about the Trias Politica, the Great Depression and Stagflation, wars, and a suggestion of a constitutional amendment. Alternatively, there also is ‘economics as usual’, about prices and wages, growth and such. Part of the analysis can be presented in terms of ‘economics as usual’ - and then of course much of the political drama is lost. Part of the ‘usual’ argument can be indicated graphically.
Figure 1: Isoquants of national income

Figure 1 shows how national income is produced. Capital and labour combine in a production function and give national income. Capital is aggregated in dollars, labour in personyears. [29]
Let labour supply be LS and the unemployment rate be u. In the unemployment regime 0 only LS (1 - u) work, producing a national income of Y0 in wages and profits. The slope of the tangent gives the price ratio of wages and rents. In regime 1 LS work, producing Y1. The rise of national income from regime 0 to 1 is the increase in efficiency from going from the lower to the higher isoquant. The graph clarifies about the improvement in efficiency that: (a) more people work, (b) total income is higher, (c) average wage costs are lower, indicating lower pressure on prices, (d) hence, when there is unemployment, then there is a possible improvement, that benefits some while it needn’t hurt others.
The story of course doesn’t stop with Figure 1, and is a bit more difficult. Some points need to be developed - just indicative, not extensive:
1. We have to show that (current) unemployment is inefficient indeed, and that it is not caused by technology or globalisation or labour market inflexibility (which would cause it to be a form of efficient unemployment).
2. Wages may fall on average, but the story for each individual is different. We have to deal with heterogeneous labour. And we have to develop the impact on inflation.
3. An econometric problem is that observations are based on observations of LS (1 - u), i.e. on the inefficient area, so that extrapolations towards the true efficiency frontier are difficult, especially when labour is heterogenous.
4. Policy makers tend to see the decision process as a clash of preferences. When a tax reduction is proposed, to tackle unemployment, then this is translated in their minds into terms of the (re-) distribution of income - and then it is quickly opposed. So we have to deal with this source of misunderstanding too.
5. Though above uses a Bergson-Samuelson social welfare function, many economists are hesitant about that approach and refer to Arrow’s theorem. This matter then needs clarification too.

Indeed, I might present much of the argument along these ‘economics as usual’ lines.
But doing that makes part of the problem go away. We no longer see the dead of the two World Wars, the hungry of the Great Depression, the ruined lives of the Great Stagflation. We no longer see the devastation in Russia and many of the Eastern European Countries in the first decade after the Fall of the Berlin Wall. Closing our eyes to these issues, would be closing our eyes to the evidence for the need for an Economic Supreme Court.
The critical observation is: If economics would not confront the serious problems of mankind, it would lose it relevance to democratic policy making, and would rather become disinformation and a veil for anti-democratic policy making. It would become an accomplice in economic policy stagnation.
If economics is a science, then it must regard facts as sacred.
Many economists don’t quite understand this. When they see some unpleasant facts, they run, and start studying something else. Or they live in the corridors of power, and - like politicians - massage the facts, and make those fit the mold of the times. But running from a scary fact shows only a partial understanding of their importance. The proper attitude is to stare at the facts till they don’t go away and till they aren’t scary anymore, and then adjust theory to fit them.
Sometimes it is said that ‘facts’ don’t say much, but that it is the theory that makes them tick. People have lived for ages with the ‘facts’ that the moon is 2D round and shows stages of illumination, but it took them almost as long to accept 3D roundness of heavenly bodies as a theory. Admittedly, it is hard to impossible to pinpoint a ‘fact’ without also invoking theoretical concepts. But it would be wrong to switch to the view that ‘everything is theory’. Facts do exist, they can bite, and economists can be scared by them.
It is scary to economists that economic disaster can be related to the role of economics and economists.

At a crucial moment in his life J.M. Keynes was what we nowadays would be calling a ‘whistleblower’. As a civil servant and senior Treasury representative he served at the Versailles negotiations after the First World War. At a certain moment he resigned, and wrote The Economic Consequences of the Peace (1919). Many people thought that he should have kept silent given his position as (ex-) civil servant, and perhaps this played a role in his never becoming a full professor. I don’t have the intention to resolve this issue. But a valid question is: Would it not have been better if we had had Economic Supreme Courts at that time, that because of their scientific agenda would have put Keynes’s analysis up for discussion, that would have given him more protection, and that would have forced the other branches to answer to some questions ?
Another example is Keynes’s General Theory in 1936. Note that Hicks’s simplification of IS-LM was available in 1937. Then the same questions.
The General Theory itself contains the famous lines: “Practical men, who believe themselves to be quite exempt from any intellectual influences, are usually the slaves of some defunct economist. Madmen in authority, who hear voices in the air, are distilling their frenzy from some academic scribbler of a few years back.” (p383) He continues: “(…) there are not many who are influenced by new theories after they are twenty-five or thirty years of age, so that the ideas which civil servants and politicians and even agitators apply to current events are not likely to be the newest.” Perhaps Keynes would have supported the idea of an Economic Supreme Court that keeps its knowledge up to date.
A third example is Jan Tinbergen’s 1936 model of the Netherlands (vide Barten (1988), with p48 highly amusing). The same questions.
The fourth example involves my own person at the Dutch Central Planning Bureau (CPB) around 1989-1991. This book already wins the argument without mentioning my own experience, but it would not be correct not to mention it. This book presents an analysis that has been suppressed by that bureau with abuse of power - see also my biographical appendix. Then the same questions.

Again, as above, there must be a warning about stagnation. My question “Would it not have been better if we had had Economic Supreme Courts at that time ?” is, admittedly, quite rhetoric, and may tend to sweep away deeper questions. It may suggest ideal Courts that always remain impartial and always come to the rescue. But also a Court can get stuck on misconceptions. Keynes and Tinbergen illustrate the point themselves by the famous criticism of Keynes (1939) of Tinbergen’s method. Two of the leading economists of their times did not agree ! Indeed, this is a powerful argument to make the concept of a Court doubtful. (And they did not disagree on policy - more public works - but rather on methodology.)

Interestingly, Frank Sulloway’s (1996) “Born to rebel” argues, roughly put, that first-borns tend to be more conservative and that later-borns are more open to new scientific findings. Van den Berg (2004) calls this finding into question. But an Economic Supreme Court packed with conservatives could be a recipe for stagnation anyway. [30]
To be sure: my question of ‘would it not have been better if…’ is not intended to be rhetoric, and I grant that a Court at times will be slow to take up a challenge.
There however is a proper analogy: In the same way, occasionally, a fireman is caught causing fires himself. But this does not cause us to abolish the whole fire-department. As said, the appendix contains an example constitutional amendment that tries to find the middle ground, something that is workable and a huge improvement compared to the current situation.
The modern economist entertains a sharp distinction between science and values. This indeed is a proper attitude, and also a crucial instance of the division of labour. It is up to Parliament and the President to set the course and make the value judgements - and once the ship’s course has been set, economists will build the ship, rig the sails and do whatever necessary to get there. [31]
It is interesting to observe however that economists regularly express values. It is well-known that Marshall and Tinbergen were drawn to the subject out of a desire to understand the causes of poverty and ‘do’ something about it. Less well known may be this quote of Pigou:
“I would add one word for any student beginning economic study who may be discouraged by the severity of the effort which the study, as he will find it exemplified here, seems to require of him. The complicated analyses which economists endeavour to carry through are not mere gymnastic. They are instruments for the bettering of human life. The misery and squalor that surround us, the injurious luxury of some wealthy families, the terrible uncertainty overshadowing many families of the poor---these are evils too plain to be ignored. By the knowledge that our science seeks it is possible that they may be restrained. Out of darkness light! To search for it is the task, to find it perhaps the prize, which the “dismal science of Political Economy” offers to those who face its discipline.” --- A. C. Pigou [32]
Keynes wrote the General Theory not only motivated by the beauty of economic theory itself but also against the backdrop of the Great Depression and the threat of communism and facism, and war. He even presented the GT somewhat in the fashion of ‘either you accept my theory or there will be a world revolution’:
“The authoritarian state systems of to-day seem to solve the problem of unemployment at the expense of efficiency and freedom. It is certain that the world will not much longer tolerate the unemployment which, apart from brief intervals of excitement, is associated - and, in my opinion, inevitably associated - with present-day capitalistic individualism. But it may be possible by a right analysis of the problem to cure the disease whilst preserving efficiency and freedom.” - GT:381
What do we make of these value judgements ? Do these economists cross the line ? Do they wander in the perk reserved for politics ?
The answer is no. They only emphasise that society may be well willing to do something decent about unemployment and poverty, if only people had the knowledge. If the knowledge is lacking, then society faces a tough choice, and people in power will tend to look after themselves first. But with the knowledge, the situation is entirely different, and even those in power will be quite ready to help create the new prosperity. By doing so, they may also become popular, and gain or retain power. Note that it is not obvious or self-evident that the powerful will allow such change, but they might be persuaded to it.
Of course, in a sense, it could be considered a political act, when one provides crucial knowledge that changes a situation. But properly seen, this is just the definition of a scientist: to provide knowledge. Scientists can be knowledge (power) brokers - see also Throgmorton (1991). If one does not like this role of scientists, then throw out Montesquieu too.
In the same manner the economist can, with his or her knowledge, elucidate the moral problems of society. People may not be aware of certain choices that they implicitly make, and they will be grateful - though not necessarily happy at the first instance when responsibility dawns on them - when these choices are pointed out. The economist then again is only helpful in clarification. Though of course it is often wise to only try to clarify matters if one can predict that this will cause a change - otherwise much discussion and sweat will have been for nothing.
But clearly, the economist has to be protected by the Constitution to be able to perform his or her task of clarification, since new or seemingly contrary ideas always run the risk of misunderstanding and disproportional reaction.

My analysis in 1990 was, vide Colignatus (1990a), and the first edition of this book in 2000 stated:
“In my analysis the moral imperative for the Western nations since the Fall of the Berlin Wall is to help the Russian and Eastern European peoples to recover from the brutal communist oppression that they have suffered. The best way to help is to allow trade. But the West is afraid for cheap products, and thus its own unemployment. And hence there are barriers to trade again. But the true cause of unemployment is not external, but internal to the West, internal in our system of economic policy making. It is the West’s own stupidity that causes hurt to others.”
The second edition of this book in 2005 witnesses the Enlargement of the European Union on May 1 2004. This is a great step in the right direction. There are still obstacles, however, if not internally to the EU then externally to the other nations.
The argument thus has not changed fundamentally.
Hence, the moral imperative for Western nations is to
reconsider the Trias Politica structure of economic policy making. [33]
In ‘economics as usual’ we neglect the World Wars and concentrate on the current problem of stagflation. This book then also provides a novel explanation in this area - novel in the sense that it bundles the articles that have been written since 1989.
In the years after World War II, Western societies created systems of social security - the ‘Welfare State’ - and for a while it seemed as if they could do so without serious economic consequences. From a macro-economic point of view, they hoped to enjoy growth, full employment and low inflation. These indeed happened in the golden years 1950-1970. However, there arose the problem of stagflation around 1970, i.e. the combination of high unemployment, high inflation and stagnating growth. Around 1980, unemployment and inflation reached double digit values. Other economic indicators in the red were budget deficits, high interest rates, and the crowding out of private investments. Adjustment to these problems has been difficult and slow. The economic performance around 2004 is a major improvement from the worst episode, but the progress seems to be stagnating. The ongoing discussion in policy making circles during all these years is how the Welfare State arrangements are related to these economic problems, and what the proper policy reaction should be.
Welfare state economics differs from ‘traditional’ macro-economics in that there are more arrangements that protect individuals from insecurity and that entitle them to benefits. Welfare state economics however does not differ from ‘traditional’ macro-economics in the respect that the basic laws of economics cannot be changed. Generous as arrangements can be, people fundamentally still react to incentives. Welfare state arrangements tend to reduce the base of the economy of those participating in the workforce and they increase the burden on those. The welfare state also tends to generate more unemployment and inflation. While unemployment would ‘traditionally’ cause people to lose their income and thus to be more cautious with their wage demands, in the welfare state they receive an unemployment benefit and may continue tot insist on high wages. These points can readily be verified from comparing the results of the EU and US economies, where the EU is more of a welfare state and where the US has more traditional features.
Not surprisingly, there has been much debate about the sustainability of the welfare state. The US economy clearly is more dynamic and in many respects also more successful and innovative than the European economy. In this debate, a wide range of issues is discussed, from trade to investments, technology, monetary policy, migration, and so on. All these features indeed are very important for a balanced economic judgement. A common conclusion remains that employment plays a key role, as is for example witnessed by the OECD (1994) “Jobs Study”, the OECD Economic Studies 31 (2000), OECD (2003), to name a few. This conclusion actually is not so surprising, since the very definition of the welfare state suggests that it tries to protect people from the uncertainties of the job market rather than anything else.
Many people accept these days that Western economies have a problem with jobs with a low level of productivity and thus a low level of market-earned income. The United States tolerate more poverty while Europe sets its minimum wage much higher so that Europa has more unemployment. This problem with low productivity jobs finds various explanations, notably those of technology, globalisation and labour market inflexibility - or ‘welfare state sclerosis’. Policies based on these explanations have been enacted for some time now. For quite some time, in fact; while little is being achieved. It is proper that we pose the question: why is it that we don’t achieve much ? [34]
The novel analysis presented in these pages finds the problem and answer in taxes. [35] As noted, benefits have to be financed, and the tax arrangements have a key impact on incentives and costs. We will focus on the influence of taxes that runs via the labour market, both directly by ‘labour taxes’ and indirectly by ‘consumption taxes’ that also affect the cost of labour. The emphasis in our study is on dynamics where interactions have more time to take hold. The idea of this present study is that by proper management of tax dynamics, the economy could become more efficient, in both the EU and US alike, so that ultimately the drawbacks of a welfare state can find a better balance with its advantages.
Obviously, when this analysis is new, then it has not been recognised before, and then it has likely been missing in policy. And policy that was based on a wrong analysis, is likely to have been the cause of the very problem that it wanted to solve.
The emphasis on taxes does not mean that technology, international trade and labour market inflexibilities are irrelevant. It does not mean that we can throw away the current macro-economic models. On the contrary: the emphasis on taxes is only an amendment to the current models. The tax analysis would be meaningless without these current models. I myself participated in the construction of the CPB (1990) Athena model, a sectoral model of the Dutch economy with 7000 variables, and I would be the last one to suggest that only taxes matter !
Though the amendment sounds simple, there still are grounds to cover. Unemployment obviously has a much longer history than the current problem. Also, the Western track record on unemployment can only be understood when the record on inflation is taken into account too. A wrong diagnosis of the cause of unemployment would also have its effects via the anti-inflation policy of the monetary authorities.
Consider the empirical evidence since 1950. This track record coincides with decades:
· The 1950s had low unemployment and low inflation, and high real growth.
· The 1960s had the threat of unemployment, and governments accommodating inflation in order to actually prevent it.
· The 1970s nevertheless had mass unemployment bursting into the open, and governments accommodating high and accelerating inflation to battle it. Growth is volatile.
· The 1980s had governments come down hard on inflation, while they accept high levels of unemployment and stagnating growth as the price for stability.
· The 1990s-till-now: There are different reactions on both sides of the Atlantic. Europe appears reluctant to dress down the welfare state, accepts high minimum wages and more unemployment that is partly hidden in Welfare State programmes. The USA appears willing to accept more poverty. (This difference in regional reactions started already earlier, but is clearest in this period.)
One sees a certain “trade-off” between unemployment and inflation. Figure 2 reviews the official data for the United States and Figure 3 for the Netherlands for 1950-2001. [36] For both countries, the official values for the 1950s and 2000s are in the same lower left and favourable region, but they have been far outside of it during the years in-between. [37] Since the official statistics in the 2000s have returned to the favourable lower left region, the natural question to ask is whether stagflation has been defeated. Figure 4 reviews the situation in the Netherlands, where the official values have been extended with those on the labour force ‘not working’. [38] One can suspect that Welfare State programmes can hide unemployment.
In macro-economics, the relation between unemployment and inflation is expressed in the Phillipscurve. Next to the standard (wage-) Phillipscurve there is the (price-) Phillipscurve that gives the relationship between unemployment and (consumer) prices (and that relies upon a dependence of prices on wage-costs). A more extensive (participation-) Phillipscurve links the development of wages and prices to unemployment or ‘not-working in general’. Understanding the relationships of the curves is subtle: it is not just the inclusion of the numbers, but rather the effect on the market. Notably, when ‘disability’ means a reduction of the workforce, the remaining workers face less competition and might raise their wage demands (see Figure 4).
Figure 2. The unemployment - inflation space 1950-2001, United States

Figure 3. The unemployment - inflation space 1950-2001, Holland

Figure 4. The Netherlands, ‘official unemployment’ (drawn) and ‘not working’ (dashed)

Above rough division in decades suggests, as said, some ‘trade-off’. There is a discussion among economists whether such a ‘trade-off’ really exists, and in particular for the short run, but, with this division in decades, it cannot be denied that there are some systematic choices involved. Our object of study, stagflation, can be rephrased by observing that the Phillipscurve apparently has shifted to a higher and unfavourable position.
The authors Okun (1981), Hebden (1983), Blanchard & Fischer (1989), Friedman (1991), Phelps (1994) help to put the Phillipscurve in perspective. Extensive empirical work has been done by the Central Planning Bureau (1992a&b).
Okun (1981) emphasises the stability of the US Phillipscurve over the 1954-1969 period, but accepts that wages and prices thereafter are less flexible in the short run, due to ‘implicit contracts’ and ‘invisible handshakes’. Referring to Friedman and Phelps he notes: “In the sense that all economists must recognize that adverse shift of the short-run Phillips curve, they have all become accelerationists now (to reverse Friedman’s celebrated concession to Keynes).” (p239). Rather than getting lost in finding proper functional formats, Okun concentrates on formulating various elements that are important for policy making, indicating that a whole range of instruments must be used. The minimum wage gets short mention, but is not considered in relation to the Phillipscurve.
Hebden (1983) gives a recommendable review of econometric issues and empirical work (till that time) on the Phillipscurve, including (a) the original article by Phillips, (b) papers that remain close to his format, and (c) papers that include trade union influence and price expectations. Hebden notes:
“Models that seek to explain the causes of the inflation that has been experienced in the recent past, and hold out the possibility of helping economists to predict and maybe control inflation in the future, are sought after eagerly by economists and politicians. Many models have been produced and a fair degree of unanimity has been found as to the mechanics of the relatively mild inflation experienced in Britain in the 1950s and 1960s. But when inflation accelerated, in this country as in most of the industralised world, in the mid-1970s, those models were unable to cope; and though almost a decade of ‘hyperinflation’ has passed since then, no model that adequately explains its causes has yet been found.” (p158)
Blanchard & Fischer (1989) note:
“The Keynesian framework, embodied in the “neoclassical synthesis”, which dominated the field until the mid-1970s, is in theoretical crisis, searching for microfoundations; no new theory has emerged to dominate the field, and the time is one of explorations in several directions with the unity of the field apparent mainly in the set of questions being studied.” (p27).
On the Phillipscurve they note:
“The contemporaneous correlation between innovations in wage inflation and GNP is, however, positive and significant: it is this correlation that underlies the Phillips curve, which plays a central role in theories of the business cycle that allow aggregate demand disturbances to affect output.” (p19). [39]
Their discussion is critical and enlightening, but does not involve the role of the minimum wage. On p551 they discuss the high European unemployment, but then refer to the Layard & Nickell 1986 & 1987 model, concluding, a bit non-committingly:
“The Layard-Nickell model provides an example of how to relate the theories developed in this book to the data. It suggests a complex set of causes for high unemployment in which both demand and supply factors play a role and the labor market’s own dynamics explain the persistence of high unemployment with nearly stable inflation.” (p555).
Our analysis will allow a stronger conclusion. From the 1950s till the beginning of the 1990s the common view among economists and policy makers tended to be that the unemployment in the trade-off was “general” unemployment. This is not quite true for all economists, but many made this simplifying assumption. Nowadays we tend to link unemployment to lowly productive labour. For us it may be obvious, but compared to the earlier view of many it is a change of perspective that the once-thought-to-be “general” unemployment now turns up as a rather specific type. To make this change specific: we will hold that the unemployment in the trade-off has always been related to the distribution of productivity across labour.
The crucial insight is that the people who can demand pay rises need not be the people who run the risk of unemployment thereof. High productivity workers run less risk of unemployment and can more easily demand pay rises, while low productivity workers run the larger risk of unemployment. High productivity workers are more versatile and are able to shift the risk of unemployment to the lower income groups. When jobs are scarce, the high productivity workers even crowd out others from the labour market. [40]
The policy rule on taxes is: don’t tax low productivity labour. Why ? To keep it employed so that more productive labour will meet more competition and will not demand inflationary pay rises. In Europe, taxes on low productive labour are still high, causing a high minimum wage that causes unemployment. These taxes could be abolished, and without costs, since these workers are unemployed anyway. Similarly, marginal tax rates are less a problem than often said. The proposed alternative policy provides an improvement on both unemployment and inflation, exactly the kind of policy measure required for in the current situation.
This analysis is not common knowledge. It is missing in the economic journals, it is missing for example in Borjas’s (1996) much used textbook for undergraduates. Borjas (1996:441) states: “The minimum wage, however, affects mainly less-skilled young workers, so it is difficult to attribute much of the unemployment problem to minimum wage legislation.” [41] For policy makers, the OECD (1998) reports: “The cross-country evidence suggests that the minimum wage has no significant impact on overall adult employment.” though OECD (2000) is more guarded, see chapter 44. We will show however that a minimum wage can have huge ‘multipliers’.
It is useful to clarify the difference between currect macro-economic policy in Western nations and what macro-economic policy can be according to this book.
Current macro-economic policy:
· accepts unemployment as a consequence of low inflation and reduced deficits
· sees the likely cause of unemployment in technology, globalisation and labour market inflexibility
· focusses on aggregates and averages
· discusses the distribution of wages mainly in terms of income (in-) equality.
The new macro-economic policy:
· sees a way to combine low inflation and balanced budgets with full employment
· sees the cause of current unemployment in the system of taxation
· focuses on distributions
· discusses the distribution of wages in its relation to productivity and unemployment.
Table 2 tabulates the differences.
Table 2: Differences between current and possible policy
|
|
Current policy |
Possible policy |
|
low inflation & low deficit |
accepts unemployment |
full employment |
|
cause of unemployment |
technology, globalisation and labour market inflexibility |
system of taxation |
|
method |
aggregates & averages |
distributions |
|
distribution of wages |
income equality |
productivity & unemployment |
The new analysis means that we get a different perspective on the existing models.
For example, a current argument in Holland on labour market inflexibilities is that the replacement rate is too low. There would be a so-called poverty trap. People in a benefit situation would have little incentives to accept a job offer, since they would earn hardly more. This is regarded as a supply issue, and since one cannot raise wages (which would increase unemployment), the only solution seems to be the reduction of benefits. This was actually the statement of the Dutch Minister of Social Affairs at the presentation of the Dutch National Budget in September 1999. Even the small Socialist Party (SP) accepts this view, vide its January 2000 internet site. The Minister and the oppostion party however are misguided and badly advised. In the proper analysis the problem is crucially different. If there would be sufficient jobs then there already are regulations that people can be fined for not accepting a job offer. This fine creates an incentive of 30% in a warning stage and eventually 100% by full withdrawal of the benefit. So the problem is rather that there are insufficient job offers - with sounds more like a demand problem. By manipulating taxes, it is possible to reduce gross wage costs - and increase demand - while still allowing for a decent net income.
Another point of attention is the word ‘unemployment’. Holland in 1999 features an ‘official unemployment rate’ of about 3.2 %. It seems as if unemployment is no problem for Holland. As an economist I however cannot accept the sausage that the Statistical Office (in this case the Dutch CPB and CBS) here present. (1) Dutch ‘official disability’ is about 10% of the true labour force, (2) people older than 55 years are often excluded from the ‘official labour force’ too, (3) many people work part-time since they cannot find a full-time job, (4) many women will not work outdoors since childcare is too expensive because of the wrong wages, (5) etcetera. Many economists classify these issues under the denominator of ‘participation’, and then agree that Holland has a participation problem. However, in proper economic terms it is unemployment: people who would want jobs but cannot find them. I urge the statisticians to remain servient to economic science, as they claim they are, rather than servient to politics and disinformation.
Let us see in stylized fashion how it went wrong in 1950-2005. Our discussion uses Holland as the example to clarify the general OECD situation. The discussion will also use simplifying assumptions and few footnotes, to keep the text transparant. These defects will be remedied in the subsequent chapters.
Key aspects are:
· heterogeneous labour, and the use of an earnings distribution
· the minimum wage and unemployment
· decomposition of the minimum wage in subsistence and tax burden
· analysis of the Tax Void
· differential indexation
· dynamic marginal tax rates
· consequences for the macro model: spillover and domino effects.
Figure 5: Earnings distribution

Figure 5 gives an earnings distribution of a standard lognormal shape. The figure approximates the situation in Holland 2002, though without parttimers. With each level of income there is a number of ‘personsyears’ of people who earn that level. The earnings distribution can be used to compute how large unemployment will be below the minimum wage. Figure 6 gives the situation for the Dutch minimum wage of about € 18.3 thousand. Since Dutch unemployment is about 25% of a potential labour force of 8 million people, the graph conforms to the facts. [42]
Figure 6: Unemployment below the minimum wage

We wonder how the minimum wage comes about. We see two terms in the minimum wage, as can be seen in equation (13.1a) and its explanation:
|
M = minimum wage [43] B = subsistence [44] T = arbitrary tax function Bentham = Bentham tax function [45] y = an arbitrary level of income r = marginal rate x = exemption |
(13.1a) M = B + T[M]
(13.1b) Bentham[y] = r (y - x) for y > x,
= 0 for y
(13.1c) Net[y] = y - T[y] |
The minimum wage provides subsistence and thus consists of that net minimum and the taxes at that minimum, which is expressed by (13.1a). Since net income must be larger than B, this means for the Bentham function:
y - r (y - x)
B & equality
at M
M = (B - r x) / (1 - r)
Malthus has subsistence B enforced by nature. Under current rules of (European) welfare states, B can be higher, since people who cannot earn subsistence B are entitled to a benefit of that level. [46] Table 3 gives the Dutch example.
Table 3: Tax wedge at subsistence (single person)
|
Dutch legal minimum wage 2002 (per annum) |
€ |
|
Gross minimum wage in the official statute |
15,638 |
|
Net, after deduction of taxes incl. premiums for the employee (single person) |
12,516 |
|
Gross minimum wage: gross + premiums for the employer |
18,265 |
|
All taxes incl. premiums (though exclusive of VAT etc.) |
5,749 |
|
Tax as a percentage of gross minimum wage |
31.5 % |
|
Tax as a percentage of net income |
45.9 % |
The Dutch situation is depicted in Figure 7, the tax plot. The horizontal axis gives income y, the vertical axis the tax t. The tax line T[y] gives the Dutch tax brackets. Net income is given by the difference between the tax and the 45-degrees line (t = y). Subsistence causes the line y - B parallel to the 45-degrees line. This line cuts off a part of net income. The intersection of the subsistence and tax lines gives y - B = T[y], and this solves into the minimum wage y = M. You must earn at least M to satisfy the minimum net income requirement B.

Figure 8 clarifies that the minimum wage means that there are no full time wage earners below M, so that tax and net income are only relevant above it.

Figure 9 gives gives the same result but then as a net income plot. The horizontal axis gives income, the vertical axis net income. The tax is given by the difference between net income and the 45-degrees line. Subsistence now is a horizontal line at B. The intersection of the B-line and the net income line gives the minimum wage M. You must earn at least M to satisfy the minimum net income requirement B.

Let us now combine the earnings distribution and the tax plot.
Note that the tax figures have shaded areas only above the minimum wage. The tax appears effective at and above the minimum wage, but not below it. Though taxes are defined below the minimum wage, there are no taxes collected, since people are unemployed below the minimum wage. The clear area from net minimum till the gross minimum wage M can be called the Tax Void.
The difference between net and gross is called the tax wedge, and it is generally seen as a vertical jump. There is a change of perspective now, in that we see it also as a range, particularly relevant for the minimum wage.
In the Tax Void the tax code has only a paper function (in terms of tax collection). The tax code helps to drive up the minimum wage, but it does not collect any revenue. Abolishing taxes in this area therefor does not cost anything too. Note that abolishing the tax void would mean that exemption would be chosen at subsistence.
Figure 10: Tax Void Unemployment

Part of unemployment below the minimum wage is still above subsistence. If taxes would be abolished in that section, then the affected people could still earn a living wage, and need no income support. This kind of unemployment can be called the Tax Void Unemployment. Figure 10 gives a plot of that section (shaded) for Holland.
For the record: the Dutch minimum wage only holds for fulltimers, and not for parttimers. Holland has a lot of parttime work (for that reason). We have eliminated parttimers from the present analysis.
How has the tax void come about ? Since abolishing the tax void does not cost anything, and would generate a lot of employment, why don’t we abolish it ? Why do we continue the present absurd situation of mass unemployment ?
It appears that the situation has come about gradually, by a mechanism that is difficult to observe directly. It involves the co-ordination of tax policy with social policy, specifically the indexation of taxes and subsistence.
First note that OECD countries adjust their taxes for inflation, see OECD (1986). Tax exemption in 2002 will often be close to the inflation-adjusted real value of 1950. On the other hand, research in social psychology shows that subsistence tends to rise with the general level of income, the growth of which consists of inflation and real growth (or real net income). So there is “differential indexation”. In the 1950s exemption was pretty close to subsistence, so that there was no void to speak of. Since then, exemption has lagged behind the standard of living. When tax exemption lags behind net subsistence, then there is a multiplier effect on gross subsistence, with an accelerated increase of the tax void. This process also explains the ‘squeezing of income differentials’ in OECD countries.
Holland is the example again. In 1951, exemption for a single person household was € 354 and for a couple without childern € 463. At that time there was no official minimum wage, but it can be taken at that value. The price level in 2002 (1951=1) is 6.25 and the wage index 2002 is 25.59. This allows us to construct Table 4.
Table 4: Development of tax exemption in Holland
|
Euro’s |
1951 |
1997 |
2002 |
|
Inflation index (%) |
100 |
545 |
625 |
|
Wage index (%) |
100 |
2082 |
2559 |
|
Exemption, single person |
354 |
3223 |
8025 |
|
Idem, price adjusted |
354 |
1930 |
2211 |
|
Idem, wage adjusted |
354 |
7369 |
9060 |
|
Exemption, couple without children |
463 |
6445 |
*13116 |
|
Idem, price adjusted |
463 |
2524 |
2892 |
|
Idem, wage adjusted |
463 |
9638 |
11850 |
* Dutch readers can find the computation in Colignatus & Hulst (2003)
Till 1997, official exemption € 3223 lagged strongly behind the wage adjusted 1951 value € 7369. In recent years the gap has been reduced, but the 2002 official exemption of € 8025 still lags more than € 1000 behind the wage adjusted 1951 value. Most important, it lags € 4500 behind the (single person) net minimum wage of € 12500.
If we index tax parameters on inflation only, then this affects exemption x in the Bentham tax function, and thus x should be included in the function call.
|
P = price index x[0] = exemption at the xi = real exemption index |
(13.2a) x = x[0] xi P (and here xi = 1)
(13.1b’) Bentham[y, x]
(13.2b) Bentham[y, x[0] P] = r (y - x[0] P) |
We also write the tax function as T[y, x] and net income as Net[y, x].
The indexation of subsistence differs from other incomes. When wages follow, on average, an index wi, the real subsistence index rsi commonly follows the net average wage, i.e. the wage after taxes.
|
W = the average wage (nominal) W[0] = the average wage in wi = wage index = W / W[0] rwi = real wage index = wi / P B[0] = subsistence in the base year h = B[0] / W[0] rsi = real subsistence index rnai = real net average wage index |
(13.3a) W = W[0] wi = W[0] rwi P
(13.3b) Subsistence = B = B[0] rsi P
(13.3c) rsi = rnai =
|
We choose the base year so that x[0] = B[0]. Let W[0] be the average wage in the base year, and let h = B[0] / W[0] be the base year ratio with subsistence. Then the index of real (net) subsistence rsi is set to the index of the real net average wage rnai, and is (proving (13.3d)):

with B[0] = W[0] h:
(13.3d)
For example, if base subsistence is half the base year average wage, B[0] = ½ W[0] then h =0.5. When r = 0.5 then rsi = 0.33 + 0.67 rwi.
With h and B[0] given, the causal chain
is {rwi, r}
rsi
B
M
u. [47]
Before we continue it is useful, however, to first clarify a formal property for the Bentham tax function.
Property (13.3e): For the Bentham tax function: There is equal growth of gross and net income, if and only if exemption is indexed on either.
Note: The distinction between (13.3d) and (13.3e) is that the former indexes x[0] on P only, and the latter indexes x[0] and B[0] on wi = P rwi.
Corrollary: Under (13.3e): If the income distribution remains the same (all incomes grow with the same rate) then also the average income, y = W grows at the same rate, and then also the net income distribution remains the same, and then the ratio of net average to subsistence remains the same too. Note: Western nations thus could wisely index subsistence and exemption on gross average income.
Note: These relations seem obvious enough, but actually proving it turned out to be a bit tedious.
Proof: Denote y[+1] = (1+gr) y = g y for growth rate gr, and Net[y[+1]] = n Net[y] (both g and n one period indices).
Net income with the Bentham tax is Net[y[+1]] = g y - r (g y - X) with X the new exemption. This should be equal to n Net[y] = n (y - r (y - x)). Thus n is defined by:
g y - r (g y - X) = n (y - r (y - x))
(
)
Take z = g = n. Then z y - r z y + r X = z (y - r y + r x) and
this gives X = z x.
(
g)
Take X = g x. Then g y - r(g y - g x)) = n (y - r (y - x)), so
that n = g.
(
n)
Take X = n x. Then
g y - r(g y - n x)) = n (y - r (y - x))
g y - r g y + r n x = n y - n r y + n r x
g y - r g y = n y - n r y
g (1 - r) y = n (1 - r) y
g = n
Q.E.D.
These formulas call for a graphical illustration. We only need data on rwi, r and h for a stylized display. We will take r = h = 50%. Then we need data on rwi, and we can use our example of Holland.
Appendix Table 20 gives the required data on the Dutch economy. Dutch 1951 exemption can be taken as 1951 subsistence. Before we use the data for the formula, let us first see what they mean. Figure 11 and Figure 12 on inflation P and real income growth rwi = wi / P show that the data fit above classification of subperiods for inflation and real income growth behaviour.
Figure 11: Continued inflation, stagnating real wage
Holland, 1951 = 1

Figure 12: Inflation plotted against the real wage
Holland, 1951 = 1

We now use the data for our analysis. There are four combinations of gross/net and real/nominal. This results into Figure 13. ‘Subsistence’ is always measured as a net term, and ‘minimum wage’ as a gross term. For Holland, we find that real subsistence has risen about 4-fold since 1951, and the nominal minimum wage more than 30-fold. The computed nominal minimum wage relates well to the factual 2002 minimum wage. Not only inflation accounts for the rise, but also an increased tax burden (that encounters inflation again).
Figure 13: Different indices at the minimum [48]
Holland, 1951 = 1

It was the slow rise of subsistence B and the lagging of exemption x in the 1950-1975 period that caused a multiplied rise of M, creating the Tax Void. Also, since the earnings distribution is nonlinear (lognormal), there was an even sharper nonlinear increase in unemployment.
Figure 13 shows that the real values stagnate since about 1980, and that the development since then is determined by inflation. Since inflation does not occur in the rsi index, the real situation is stable. For example, the gross-to-net ratio at the minimum since 1980 is quite constant.
Note too that this in a sense presents a difficulty. The problem with the minimum wage was caused before 1980, and policy makers wanting a solution in 2002 will rather look at the last decennium rather than to the 1950-1975 period.
While the above considers exemption x, the analysis can be extended with an analysis on the marginal tax rate r.
Many economists hold that a high marginal tax rate is a disincentive for labour effort. They frequently propose a change from the income tax to the Value Added Tax (VAT). If we assume the same total tax revenue then the VAT might allow for a lower marginal tax rate, for the reason that the VAT has no exemption. At least, that is commonly conjectured.
Above analysis already exposes one flaw to the argument ‘in favor of the VAT’. Having no exemption means a higher minimum wage ! So, those tax theorists who propose a shift from income tax to VAT tend to neglect an important part of labour market economics. Note that a higher VAT on luxuy cars does not affect the subsistence worker who cannot afford these, and hence there is some truth in the statement that a VAT sometimes can be preferred. However, once we have solved unemployment by proper labour market policies, the discussion about income tax or VAT could be done in terms of fiscal properties only, and it might quickly appear that a low VAT of say 5% suffices. [49]
Secondly, it is said that a VAT taxes profits too and thus seems to allow a general reduction of the price of labour. But it raises costs disproportionally for the lowly productive (who generally work with less capital).
Figure 14 shows the development of the relative revenue shares of Dutch income tax and VAT for a selection of years (i.e. 1975, 1980, 1985, 1990, 1997 and 2003). The Dutch minimum wage problem has worsened also by this development.
Figure 14: Revenue shares of income tax and VAT

I agree with the basic idea about the disincentive effects of marginal tax rates. Namely, economic theory assumes maximising agents, and the condition for a maximum can normally be expressed in terms of marginals. However, the marginal must be computed correctly. Above marginal rate r is only a static rate, that applies to a specific regime, for example a specific period. However, tax rates are adjusted from year to year. A dynamic situation requires a dynamic analysis.
Let
y
= y - y[-1]. Then the proper (dynamic) marginal tax rate is DMR
=
T /
y. For the Bentham function:
![]()
Generally the dynamic marginal is lower than the static marginal. In fact, when tax parameters are indexed in a certain way, then the tax can have the same growth rate as income, and then the dynamic marginal rate equals the average tax rate. This holds for individuals and for the macro data if all individuals are on a balanced growth path. Let the balanced growth rate be bgr:
(13.4)
The following is a small example of how a dynamic marginal rate can equal a normal average. Let exemption be $10000, and let the statutory marginal rate thereafter be 50%. Someone earning $50000 pays the tax of $20000, on average 40%. Let all incomes grow 5%, and exemption be indexed on national income. Then exemption becomes $10500, income $52500, tax $21000, again 40%. Thus on the (dynamic) “marginal dollar” this person doesn’t pay 50% but 40%.
For the Bentham tax function we can derive a simple
expression for individual growth. We are most interested in expected
developments. Let personal income grow by rate
, so that y[+1] = (1 +
) y, and let
exemption be expected to be adjusted by rate
, so that x[+1] = (1 +
) x. Then we
find:
![]()
Let us regard the dynamic marginal rate for a Dutchman in 2002 who considers an increase in work effort for 2003 (and beyond), and let us assume a regime of sound economics. In the ideal case, exemption in the base year is put at subsistence, in this case € 12.5 thousand. Ideally, subsistence rises with income, and not just real net average incomes. This ideal implies that exemption is adjusted not just for inflation, but for the nominal growth of income. Let us assume this ideal, and let us assume that national nominal growth is 4%, for example consisting of 2% inflation and 2% real growth. Let us then regard the situation of a single economic agent. He knows that next year exemption will be adjusted with 4%. He has to judge whether it is worthwhile to him to invest or to increase labour effort, so that his income will rise. If his personal income rises with 4%, then his dynamic marginal will be equal to his present average tax rate. If his personal income rises by 8%, then his dynamic marginal will differ; it will depend upon his actual income level, but anyway will be less than the statutory marginal rate of 50%. Figure 15 gives the plot of the dynamic marginal for those two rates, for various levels of income. The 4% line here also gives the average tax level.
Figure 15: The dynamic marginal rate
Individual
income grows at 4% or 8%, while national income grows at 4%
and the statutory marginal rate is 50%

Empirical analysis often shows marginal rates to be less relevant - and average tax rates to be more important - than ‘common theory’ claims. This analysis on the dynamic marginal provides a useful part of the explanation.
Above analysis concerns minimum wage unemployment. The next question is how this relates to other kinds of unemployment.
It is useful to observe that the analysis in these pages is new. Concepts like the tax void, differential indexation and dynamic marginal tax rates, and the insights on their interaction, are really new, and have been concocted by me in a search for new scientific results. That means that governments have not incorporated these concepts in their policy making (even though the occasional civil servant may have been aware of some phenomena). Policy making up to now has been based upon a different analysis, and, alas, by being different from the right analysis, the governmental analysis is a wrong one. This is not without consequence. By analogy, when a patient gets a medicine based on a wrong diagnosis then the illness may get worse rather than diminish. In the present case, the tax void unemployment has important spillover or domino effects on unemployment above the minimum wage, and the channel of transmission is the misguided policy reaction up to now.
For example, in the 1970s governments tried to stimulate the economy by incurring big deficits, but they ended up with inflation. In the 1980s and 1990s governments opt for low inflation, and they end up with high real rates of interests and mass unemployment in Europe and poverty in the United States.
For example, Dutch economic policy is based on a general restraint on wages. This policy has fueled Dutch exports and reduced Dutch imports. The general restraint in fact subsidises exports, and Holland runs an external surplus for quite some years now. The internal imbalance is reflected in an external imbalance. The proper policy reaction however would be a wage cost policy targetted at the minimum.
Please note that the present review only gives a diagnosis, and that it is a different affair to find the proper therapy. The first is necessary step before the second can be considered.
In the course of some years I have experienced that discussing therapy is useless when people do not even understand the diagnosis. Policy makers tend to be focussed on therapy - but judge this from a wrong diagnosis. For example, in The Hague in 1992 (at a social-democratic political rally when I was no longer a member of his party) mr. Wim Kok, the Dutch Prime Minister of 2000, occasional chairman of the European Union and the social-democratic ‘respected elder’ to mr.-s Clinton, Blair, Schröder and Jospin, and a person who did some basic econometrics in his younger years, laughed loudly when I suggested to raise Dutch tax exemption from the then € 3 thousand to € 10 thousand. He must have thought of staggering costs, and it didn’t help when I said that it need not cost anything.
A major remark about therapy is that to undo the damage of the last four decades, it is not necessary to take four new decades. Return to optimality can be much faster.
The alternative and new policy would be to abolish taxes in the tax void and to allow people to earn their own - decent and untaxed - living. This alternative policy reminds of an old rule. The Dutch economist Cohen Stuart proposed in 1889 (cited in Hofstra (1975)) to put tax exemption at the level of subsistence. To drive the point home he drafted the following analogy:
“A bridge must carry its own weight before it can carry a load.”
In 2005 there is the additional argument that abolishing void taxes will not cost anything, while nations will save benefit payments due to more employment.
Note that the ideas of Cohen Stuart’s ‘bridge’ and the tax void are not very complex in themselves. In 1991 I explained them to a 12 year old kid and he commented: “A child can understand that.” Still, the EU and its score of modern governments sin against these concepts.
If unemployment is inefficient, then by definition there is a Pareto optimising solution, that will not cost anything. Most economists don’t believe in cheap solutions. Much of the debate hence focusses on ‘efficient unemployment’, where the sad state is caused for example by globalisation, technology or ‘welfare state scelerosis’ (with poverty traps). But, clearly, the tax void exists, it is a cheap way out, and the other arguments will turn out to be ghosts, which they already can be shown to be.
Note though that some period of transition may be required. Policy makers will be hesitant, advisedly, about an overhaul of the tax system. Note, then, that the tax system defines our notion of a subsidy. A wrongly levied tax, in this case the tax void, can be compensated for by a wage cost subsidy. [50] Abolishing the tax void is more sensible in the long run, but since this can only be done gradually, then some general subsidy directed at lowly productive jobs would speed up short term adjustment. The rule would be that those subsidies are reduced when tax exemption rises towards subsistence.
More employment.... Does that not fuel inflation ? The pieces of the puzzle fall into their places when the tax void is related to the unemployment & inflation problem. The steady rise of the tax void explains the track record of unemployment and inflation. The 1950s have been characterized by relatively low taxes on low income earners, and this allowed for full employment and low inflation. From the 1960s onwards the lagging tax exemption started causing problems with unemployment. The tax policy since at least 1965 enhanced the imbalance of the internal bargaining positions of labour instead of counter-balancing it. Hence inflation was persistent, and high levels of unemployment were required to achieve price stability.
As said, governments suffer from a co-ordination problem. How governments reacted in the past depended upon the view of the day. Since the proper solution was not known, the problem did not go away. The differential indexation of tax exemption and the social minimum did not draw attention to itself. Each year adds only a slight effect which is hard to see. But over the years the void has accumulated, and with huge consequences. And the problem will remain with us in the future unless policy changes.
The co-ordination problem persists, currently. Governments currently regard minimum wage unemployment as just one type of unemployment, and not even the most important type. Current policy is based upon other explanations for unemployment, notably those of technology, globalisation and flexibility. The policy reaction based on these views is to reduce taxes for higher incomes, so that they are encouraged to work, invest and spend more, and so that labour market flexibility might be increased. However, the ineffectiveness of current policy can be explained by the fact that these views are not entirely logical. The arguments of technology, globalisation and flexibility run up against contradictions:
· Technology is a source of wealth, and it boosts the productivity of the lowly productive jobs, making the problem of poverty and unemployment less serious than it would otherwise have been.
· “Globalisation” is a scare word for “trade”. Trade however is another source of wealth, and it too has been with us for ages. Rising wealth in distant countries means rising wages over there, and trade itself thus puts limits to foreign competition. Japan over the last 60 years is a prime example of this phenomenon, but every rich nation has had the same experience.
· The “flexibility” or “welfare state sclerosis” argument can only explain that the US has poverty and Europe unemployment, but it does not explain that there is a problem with low productivity jobs in the first place. The poverty trap as said does not exist.
Thus to be sure: the real policy target is low inflation, and policy makers only discuss technology, globalisation and sclerosis/flexibility in a second line of the argument. This second line is essentially a cop-out, since it does not concern the real issue - and a discussion can be very tiring if people behave like that.
At the same time, the wrong policies work counterproductively. The reduction of taxes for the higher incomes obviously is financed by a reduction of provisions for the lower incomes, aggravating the minimum wage and poverty problems.
In my analysis, the present situation bears another surprise. We diagnose current unemployment as inefficient. Be sure that you see what inefficiency means: it means that there is a solution that is beneficial to some and that does not hurt others. Having a bright idea always means a “win-win” situation or a free lunch. In the present case there is the move to full employment under price stability. The present unemployed will find jobs. The higher productivity group will have a theoretically larger risk of unemployment, but in practice this risk will be modest as in the 1950s. The real gain for the higher income earners will come from the services that will be provided by the jobs of the presently unemployed. So you do not need to reduce taxes for the higher paid, since they already will have a real gain at current income.
This was it, in a nutshell. Now I beg your understanding. My analysis is more complex than can be stated in these few lines. Both tax policy and social policy are quite complex themselves, and this certainly holds for their interaction with inflation and unemployment. For example, you may ask why I haven’t discussed income redistribution effects. Actually, this is because the alternative policy could be neutral to the income distribution. The reason for this is that the analysis focusses only on the link between wage costs and productivity. But you might want to hear more about this. Also, you might ask whether above explanation covers all possible cases of unemployment and inflation. Of course it doesn’t. The analysis does help to clarify that other types of unemployment need other types of policy, such as education and so on. But you might want to hear more on that too. These are just examples of issues, and there are many more issues that need to be dealt with. Which space forbids. However, given that my model amends existing economic models, much of the required explaining is ‘existing economics’.
This novel explanation is in the tradition of Keynes and Tinbergen while it fits in with mainstream economics. When economists check and confirm these findings, our economies are likely to enjoy more growth with full employment and low inflation.
While the above uses a stylized example of Holland, there is a short and enlightening story about actual Dutch politics, far remote from econometric regressions. Quotes are here in my translation, Dutch readers can also read Colignatus (1994b:28).
In Dutch politics, parties have to form coalitions to be able to govern, and the Biesheuvel 1971 cabinet came about by a coalition agreement that contained the following plan:
“Increase of tax exemption (in the direction of equality exemption for married couples with one child towards the minimum wage (….))”
The explanation of this idea to parliament was (MvT 1971/72):
“(…) it doesn’t require more adstruction that current exemption is too low. Its size doesn’t satisfy the fundamental notion of a threshold, the exemption of taxation of part of income, that is reasonably required for financing the necessary means of existence as seen in contemporary social views.”
This plan didn’t succeed, the government broke down prematurely. There came about a new leftist government under leadership of Den Uyl, and his Minister of Finance was Wim Duisenberg, the president of the European Central Bank in 2000. This cabinet however rejected above concept. The 1974 argument was:
“De government (…) explained that the social minimum had been raised in the preceding years in such extent that it could be considered to provide means to pay taxes.”
The latter statement is rather shocking. Subsistence is by definition a net concept, and the politicians don’t stick to that definition. The statement also means that someone who falls in the tax void is forced into a benefit situation. [51]
What is alarming too, is that Duisenberg was not alarmed, didn’t veto this nonsense.
After this ‘Duisenberg disaster’, the issue disappeared from people’s mind, it got transformed into an annual debate on indexation and the topic of discussion became the level of benefits for the needy. In 2005 Holland still suffers the consequences.
In March / April 1996 I put two
presentations for the general public in the Economics Working Papers archive at
the Washington University at St. Louis. In August 1998 there was a third paper.
[52]
These papers are directed to a general audience, and to teachers and students.
Since this current book basically addresses economists and uses quantitative
methods, I doubted whether I should include these texts here, also since there
is some overlap that can be distracting. There however are two good arguments
to include them with little adaptation: (i) Once a fellow economist is starting
to grow convinced of the value of my analysis, then he or she will face the
same problem of explaining it to others. These texts then can be of use. (ii)
The historical date of these texts underlines the co-ordination problem. Even
when a good summary was available, and even when the moral imperative facing
Western nations was clearly formulated, our failing systems of economic policy
making limped along, and caused misery upon misery for many of its citizens.
A breakthrough in economic theory
Since the early 1970s Western economies have been plagued by mass unemployment and the threat of inflation. Over the years since then various economists have proposed various possible solutions, but never quite convincing ones. Now there is a novel analysis that means a breakthrough in economic theory. The present author is quite certain that the “missing link in the model” has been found. If true, this analysis offers guidelines for full employment under price stability, just as Western economies enjoyed in the 1950s. The main point is: don’t tax lowly productive labour. Why ? To keep it competitive so that more productive labour will not demand inflationary pay rises. Though this new analysis is only in the stage of presentation and introduction at the scientific fora, there is no reason to withhold the present rough sketch for a general public.
It is well-recognised these years that Western economies have a problem with jobs with a low level of productivity and thus a low level of market-earned income. The United States tolerate more poverty - the working poor - while Europe sets its minimum wage much higher so that Europa has more unemployment.
This problem with low productivity jobs finds various explanations, notably those of technology, globalisation, and inflexibility - the latter ornate for “welfare state sclerosis”. Policies based on these latter explanations have been enacted for some time now. For quite some time, in fact; while little is being achieved. It is proper that we pose the question: why is it that we don’t achieve much ?
Unemployment obviously has a much longer history than the current problem. Also, the Western track record on unemployment can only be understood when the record on inflation is taken into account too. Economic science has much to say on the complex relationship between inflation and unemployment. Now, we are forced to be brief here. We will concentrate on what is new and on why it is new.
We set out with the empirical evidence since 1950. This track record can be divided in meaningful decades:
· The 1950s had low unemployment and low inflation.
· The 1960s had the threat of unemployment, and governments accommodating inflation in order to actually prevent it.
· The 1970s nevertheless had mass unemployment bursting into the open, and governments accommodating high and accelerating inflation to battle it.
· The 1980s-till-now had governments come down hard on inflation, and accepting high levels of unemployment as the price for stability.
One sees a certain “trade-off” between unemployment and inflation. From the 1950s till the end of the 1980s the common view among economists and policy makers was that the unemployment in the trade-off was “general” unemployment. Nowadays we tend to link unemployment to lowly productive labour. For us it may be obvious, but compared to the earlier view it is revolutionary that the once-thought-to-be “general” unemployment now turns up as a rather specific type. To make the revolution specific: we will hold that the unemployment in the trade-off has always been related to the distribution of productivity across labour.
The crucial insight is that the people who can demand pay rises need not be the people who run the risk of unemployment thereof. High productivity workers run less risk of unemployment and can more easily demand pay rises, while low productivity workers run the larger risk of unemployment. High productivity workers are more versatile and are able to shift the risk of unemployment to the lower income groups. When jobs are scarce, the high productivity workers even crowd out others from the labour market.
Now obviously, when this is new, then it has not been recognised before, and then it has likely been missing in policy. And policy that was based on a wrong analysis, is likely to have been the cause of the very problem that it wanted to solve.
Let us see how it went wrong. Regard the legal minimum wage and note that people are not allowed to work below that minimum. Note too that there hence will be no earnings that can be taxed in that range. We can call this range the “tax void” or “tax vacuum”. However, tax statutes are defined in that range anyhow. Tax statutes in that void are actually used to define the gross minimum wage. In Europe, the high gross wage will cause unemployment and its related benefit burden. In the US, the void is reduced a bit by accepting poverty. In common economic terms: tax policy and social-economic policy are badly co-ordinated.
How this has come about is a story of a more technical nature. First note that OECD countries adjust their taxes for inflation. Tax exemption in 1996 will often be close to the inflation-adjusted real value of 1950. On the other hand, research in social psychology shows that subsistence tends to rise with the general level of income, the growth of which consists of inflation and real growth. So there is “differential indexation”. In the 1950s exemption was pretty close to subsistence, so that there was no void to speak of. Since then, exemption has lagged behind the standard of living. The inflation-adjusted subsistence of 1950 may be only a third of 1996 subsistence. When tax exemption lags behind net subsistence, then there is a multiplier effect on gross subsistence, with a fast increase of the tax void.
The alternative and new policy would be to scratch taxes in that void and to allow people to earn their own - decent and untaxed - living. This alternative policy reminds of an old rule. The Dutch economist Cohen Stuart proposed in 1889 to put tax exemption at the level of subsistence. To drive the point home he drafted the following analogy: “A bridge must carry its own weight before it can carry a load.” In 1996 there is the additional argument that abolishing void taxes will not cost anything, and that nations will save benefit payments due to more employment.
More employment.... Does that not fuel inflation ? The pieces of the puzzle fall into their places when the tax void is related to the unemployment & inflation problem. The steady rise of the void explains the track record of unemployment and inflation. The 1950s have been characterized by relatively low taxes on low income earners, and this allowed for full employment and low inflation. From the 1960s onwards the lagging tax exemption started causing problems with unemployment. The tax policy since at least 1965 enhanced the imbalance of the internal bargaining positions of labour instead of counter-balancing it. Hence inflation was persistent, and high levels of unemployment were required to achieve price stability.
How governments reacted depended upon the view of the day. Since the proper solution was not known, the problem did not go away. The differential indexation of tax exemption and the social minimum did not draw attention to itself. Each year adds only a slight gap which is hard to see. But over the years the gap has accumulated, and with huge consequences. And the problem will remain with us in the future unless policy changes.
Current policy is based upon other explanations. Notably those of technology, globalisation and flexibility. The ineffectiveness of current policy can be explained by the fact that these views are not entirely logical. The arguments of technology, globalisation and flexibility run up against contradictions. Technology is a source of wealth, and it boosts the productivity of the lowly productive jobs, making the problem of poverty and unemployment less serious than it would otherwise have been. “Globalisation” is a scare word for “trade”. Trade however is another source of wealth, and it too has been with us for ages. Rising wealth in distant countries means rising wages over there, and trade itself thus puts limits to foreign competition. Japan over the last 40 years is a prime example of this phenomenon, but every rich nation has had the same experience. Finally the “flexibility” or “welfare state sclerosis” argument can only explain that the US has poverty and Europe unemployment, but it does not explain that there is a problem with low productivity jobs in the first place.
The present situation bears another surprise. We diagnose current unemployment as inefficient. Be sure that you see what inefficiency means: it means that there is a solution that is beneficial to some and that does not hurt others. Having a bright idea always means a “win-win” situation or a free lunch. In this case it is the move to full employment under price stability. The present unemployed will find jobs. The higher productivity group will have a theoretically larger risk of unemployment, but in practice this risk will be modest as in the 1950s. Their real gain will come from the services that will be provided by the jobs of the present unemployed.
Policy makers will be hesitant about an overhaul of the tax system. Note, then, that the tax system defines our notion of a subsidy. A wrongly levied tax, in this case the tax void, can be compensated for by a wage cost subsidy. Abolishing the tax void is more sensible in the long run, but when this can only be done gradually, then some general subsidy directed at lowly productive jobs would speed up short term adjustment. If only those subsidies are reduced when tax exemption rises towards subsistence.
This was it, in a nutshell. Now I beg your understanding. My analysis is more complex than can be stated in these few lines. Both tax policy and social policy are quite complex themselves, and this certainly holds for their interaction with inflation and unemployment. For example, you may ask why I haven’t discussed income redistribution effects. Actually, this is because the alternative policy could be neutral to the income distribution. The reason for this is that the analysis focusses only on the link between wage costs and productivity. But you might want to hear more about this. Also, you might ask whether above explanation covers all possible cases of unemployment and inflation. Of course it doesn’t. The analysis does help to clarify that other types of unemployment need other types of policy, such as education and so on. But you might want to hear more on that too. These are just examples of issues, and there are many more issues that need to be dealt with. Which space forbids. However, given that my model amends existing economic models, much of the required explaining is ‘common economics’.
There remains one major point. That tax exemption is low, is defended by OECD governments with the argument that it keeps marginal rates down. And the attractiveness of low marginal rates is that they spur economic activity. My finding however is that the latter claim is only true when the marginal rate has been defined properly. Thus I agree with the claim, but it must concern the proper marginal tax rate. There is a difference between the proper rate, which is dynamic, and the rate used by OECD governments, which is the static and statutory rate. Dynamic analysis shows that the proper marginal rate will be close to the average rate. This part of my analysis is important for economic growth. Having less unemployment will mean lower average taxes, and thus lower proper marginal rates, and thus more incentives for sustainable growth. For many of my fellow economists it is this part of my analysis that will come as the greatest surprise of all. However, this is not an issue that can be settled in this review, and here I definitively have to refer to my extensive analysis.
This novel explanation is in the tradition of Keynes and Tinbergen while it fits in with mainstream economics. When my fellow economist check and confirm these findings, our economies are likely to enter into a new high growth path with full employment and low inflation.
Allow me to add the personal note that I am overjoyed by these findings.
(March 1996)
World developments in the 1990s show a worrysome parallel to the 1930s with the Great Depression. Present-day Russia reminds of the pre-war Weimar republic, where a devastated economy and weak democracy allowed Hitler to take power. Western nations in the 1990s hinder trade with Russia and the Eastern nations for fear of unemployment at home, as they did in the 1930s with Germany. If trade were stimulated instead of hindered, Russia could regain economic and political stability by itself. The moral problem is not external and does not concern whether Russia would need financial aid. The moral problem is internal, and concerns whether Western political leaders are willing to face their own errors that cause the present mass unemployment at home.
Russia is shrouded in a veil of doom. A nation once proud about its achievements, is now, as so many feel, humiliated in the face of history. A loss of empire, a collapse of economic security, some coup attempts in both Kremlin and Duma, a rising reign of violence by a mafia in the main cities and by full-blown fighting at the geographical fringes, and a political arena that smells more of fear than of confidence. Like the Weimar republic in pre-war Germany, Russia has been subjected to the rules of chaos, and yet again the odds are risky - and risky for the world at large.
Something needs to be done. Something smart, something humane, something effective and efficient, and something courageous. Therefor, something which is not likely to happen quickly. However, there is one single possibility that is very much worth of our attention. It is something what we actually could do. And what - given the risks of this moment - we should do
It is trade that will help Russia and the Eastern nations to recapture economic security and thereby regain political stability. And, since it is our fear of unemployment that motivates us to block that trade, Western nations should tackle unemployment at home directly.
Our comparison of present-day Russia with pre-war Germany is no coincidence. World developments in the 1990s show a worrysome parallel to the 1930s. The 1930s suffered from the Great Depression. In the 1990s the world is again plagued by mass unemployment. Again there is a major region that is economically devastated and that desperately needs access to the world market, and yet again the other wealthier nations hinder that entry, while concentrating shortsightedly on their own problems at home, and neglecting the consequences of neglect. The West might want to reduce the risk of a Russian disaster, but not at the cost of jobs at home. Trade barriers are there to keep cheap Eastern products from “flooding” its home market. Europe throws in huge subsidies for its agricultural exports. Western tariffs or quality requirements are pitted against Eastern exchange rates, in a war on trade whatever its consequences on economic and political stability.
The West is dugging in and seems to repress the recognition that history is repeating itself. Again the world finds itself in a deadlock, and yet again chaos feeds on it.
But we should remember the trade war of the 1930s and the rise to power of Adolf Hitler ! In the 1930s the same mechanism of trade, unemployment and political instability applied. In this period it was Germany that was the weak nation. The Versailles Treaty of 1919 that ended World War I put Germany under a huge reparations bill. The world forgot that the war had been started by an autocratic Kaiser and that Germany now had a new, fidging democracy. To pay that bill, this weak democracy was obliged to cut imports and to spur exports. The reparations bill worked like a foreign tariff that took away funds that could have been invested otherwise. By the end of the 1920s Germany defaulted on its international debt - and thereby indirectly caused the Wall Street Crash of 1929. Thereafter, all nations scrambled for the life-boats. Nations feared for their home markets and employment, and defended themselves by exchange rates and tariffs. In their fear they made things only worse. The German economy collapsed, and on the teutonic waves of resentment its weak democracy toppled and Hitler took power.
Let us now compare: Is the Russian democracy anything other than new and fidging ? Have its generals not tried to seize power ? Have its tanks not roared against its very own Parliament building ? Has its economy not dropped by a third? Or conversely, have all its nuclear weapons and uranium stores been savely secured ? Have the Western nations done their utmost in opening their markets ?
Of course, there is a glimmer of hope. The Russian capacity for suffering is impressive. Few nations could sustain this suffering and national disgrace without lapsing into resentment, cruelty and violence on a much larger scale than we actually see in Russia. The West has provided some funds and done something more. The world is not at war and may not be at war for some time. The probability that things go right is large, and there is only a small chance that things go wrong.
But please consider: If the only glimmer of hope is that the world is not at war, then the situation is quite depressing. Hope is not the point, and neither likelihood nor expectation. The point is risk. Risk comes from the arithmetic of loss multiplied by chance. Thus: risk = loss * chance. If things go wrong in Russia then the consequences will be huge, and a small chance times a huge loss gives a risk too large.
The West should open its eyes and see the economic logic. Eastern nations need to take part in the international economy and thus need modern Western equipment. To buy the latter goods they need the proper currency. Either someone gives them that foreign currency, the dollars, yen or marks, or they have to earn it themselves by exporting. To simply give them credit, on the scale required, is absurd. Therefor it is access to Western markets that is essential for those nations and for political stability. Indeed, if they had access, and if the flow of trade were to start, then the World Bank and IMF could extend credits and thereby fuel the process towards stability.
At the same time, economic science tells us that it is not trade that has caused present Western unemployment. Marking trade down as the culprit, and using trade barriers to solve a situation that trade has not caused, only makes things worse.
The moral problem is internal and not external. The cause of present-day unemployment in Western economies is internal management and not external trade. There is a failure within the internal co-ordination of macro-economic policy, a failure by our very own governments. Western nations could tackle their unemployment problem at home - if only our political leaders were willing to take a hard look at their own internal policies.
The historic parallel also concerns the current lack of attention for the internal question. Policy makers that concentrate on an external trade war neglect the internal opportunities. There is the following sobering story about the economist John Maynard Keynes. From the early 1930s Keynes advanced his solutions to the Great Depression, and this culminated in his 1936 book that changed macro-economics. Policymakers could have reacted already in the early 1930s, ... but only did so after World War II had already begun.
We might ask: Do we care about the peoples of Russia and the Eastern nations ? And should we act with economic sense ? However, those questions are imprecise. The real question is whether our leaders care so much that they will reschedule their busy agenda’s and really look into a problem that they cause themselves.
There is every reason to believe that political leaders are quite deaf on this. So pray that there will not be a new world war. So shout to your political leaders: Stop that trade war !
Do something about external trade tariffs and internal unemployment. Enable Russia to help itself.
(March 1996)
Asia and the Eastern European nations are in a state of economic turmoil. An important element for improvement is that Western nations open their markets to more trade. This is in fact what the West could have done after the fall of the Berlin Wall. But petty shortsightedness of the governing elites in the West blocks this kind of solution. The situation reminds one of the Versailles peace conference after World War I that fostered a lot of resentment and helped cause World War II. The basis conclusion is that sound economic advice is not listened to. The best advice on how to steer out of the current world macro-economic mess is that every parliament installs a committee to enquire into the process of economic advice. They could study the books by Paul Krugman, and possibly also my analysis on unemployment and my suggestion for an Economic Supreme Court.
Western nations show an inadequate reaction towards the Eastern nations since the fall of the Berlin Wall, and this inadequate reaction is repeated with respect to the current economic throes of Asia. The West displays disinterest in the hardship and actual physical pain inflicted on millions of our fellow human beings, and a neglect of the long run effects of this egotistic behaviour. Part of this inadequate reaction however is also caused by wrong applications of economic theory, so that true compassion that is out there doesn’t get the chance to show itself. One lesson is that Western nations are advised to restructure their policy making process so that governments are better served with proper economic advice.
The negligent way that the Western nations treat the other nations reminds one of the Versailles peace conference after World War I. Historians agree about the sad Western attitude at the Versailles conference. The Western Allies humilated Germany and subjected that country to decennia of economic hardship, purposely crippling its economy. These events caused a huge resentment in Germany, and this fostered the rise of Adolf Hitler. Also, Germany’s defaults on its financial obligations were a major cause for the 1929 Crash and the subsequent Great Depression. This episode is another example that two wrongs don’t necessarily make a right, and it also shows how wrongs can backlash at the wrong-do-er.
The lesson of Versailles is that opponents can often best be allowed to grow into a relationship of companionship and economic competition and co-operation for the betterment of all. Rather than subdue them or take advantage of temporary weaknesses, they could be helped so that they could help us. This lesson should now be applied to the current situations of Asia and Russia.
It is useful to recall that Western nations were not without proper advice at the time of Versailles. They were warned, and by nobody less than J.M. Keynes. As Paul Krugman recently stated about Keynes: “After that war he became famous as the author of The Economic Consequences of the Peace, an eloquent condemnation of the vindictive terms imposed on the defeated Germans; his concern was vindicated by the rise of Adolf Hitler, and the memory of his warnings helped convince a victorious America to aid, not punish, its prostrate enemies after World War II.”
Indeed, after World War II the Allies helped Germany and Japan to reorganise their countries and to prosper again. While the average citizen may be deluded by sentiments of nationalism, religion or ideology, it normally is a governing elite that abuses those sentiments for purposes of its own grandeur - and once a decent government is in place, there often appears little reason to blame that average citizen for the errors of its country. In the same way post-communist Russia deserves our sympathy, and the same holds for Asia with its different history.
But why has the West forgotten this valuable lesson ? Why do Western governments neglect Nobel Prize winner Jan Tinbergen’s work on the Optimal Economic Order, and why do we again have a show of petty egotism and shortsightedness ?
The reason is that the West is not immune to the same ‘governing elite’ processes that can be at the detriment of common welfare. The governing elites and bureacracies in the West have agenda’s of their own, and though they are restrained by democratic rules, these rules are not as strong as they could be. Our systems of checks and balances are a product of history, and not necessarily of the quality required. Politicians and bureaucrats often still can lie and get away with it. The United States e.g. had David Stockman on the budget deficit, and it took too long before that matter was settled. In general, sound economic advice still is obstructed by political processes, and policies and the electorate itself then grow misguided in their choices.
To better understand the failure of Western democracies on the issue of economic advice, one can best start by reading Paul Krugman’s books “The Age of Diminished Expectations” (1990), “Peddling prosperity” (1994), “Pop Internationalism” (1996), and “The accidental theorist” (1997). For example, when Krugman discusses US majority leader Armey’s book “The Freedom Revolution”, he states: “Armey is no fool. He cannot be unaware that he is fudging his numbers. Possibly he regards a small fib as justifiable in the service of a higher truth. Or possibly he has managed to achieve a state of doublethink, in which the distinction between what is politically convenient to believe and the objective facts no longer exists [sic]. The end result is the same: His book is an effort to obscure the stark realities (…)” (1997:60). Similarly, one can read in the American Economic Review that the US Council of Economic Advisers is rather proud of its achievements in the last decades, but we should be aware that this council is a bureaucratic body, and it hasn’t the independent position that could have protected the US economy from the events and errors as are related by Krugman in his “Peddling prosperty” saga or shown by the record of mass unemployment.
Let us now regard what the West could have done with regards to Russia after the fall of the Berlin Wall and the first free elections there - and what could be done now also with respect to Asia. I take my own 1996 paper “Enable Russia to help itself”, and quote from its summary: “Western nations in the 1990s hinder trade with Russia and the Eastern nations for fear of unemployment at home, as they did in the 1930s with Germany. If trade were stimulated instead of hindered, Russia could regain economic and political stability by itself. The moral problem is not external and does not concern whether Russia would need financial aid. The moral problem is internal, and concerns whether Western political leaders are willing to face their own errors that cause the present mass unemployment at home.”
Clearly, with this being the state of affairs, one can imagine the strength of the forces that prevent a proper discussion of these issues. Western companies embrace tariff barriers to cheap imports - and raise their own prices. Bureaucrats embrace barriers since these give a sense of control, and these also justify the very existence of this bureaucracy. Labour unions will fight unemployment at home with whatever misguided argument it takes. Governments embrace economic tales about ‘globalisation’ and ‘competition from cheap labour countries’ since these distract attention from home grown errors, and these goverments neglect economists who tell them that ‘globalisation’ and ‘competition from cheap labour countries’ are rather like fairy tales indeed. Krugman again uses the term ‘globaloney’ - and have you heard your President or Prime Minister adopting that critical attitude too ?
The best economic advice for the current situation is as follows - and I urge upon my fellow economists to adopt and spread that advice too: Every parliament could install a committee that will enquire into the process of economic advice. This committee could study Krugman’s books and my suggestions for a solution of mass unemployment and for an Economic Surpreme Court amendment to the national constitution(s). Nothing less will do. Note, by the way, that when countries start installing these committees, the markets will be quick to anticipate the directions of their conclusions, and economic recovery would already set in.
We all know Lincoln’s words: “You can fool all of the people some of the time, and you can fool some of the people all of the time, but you cannot fool all of the people all of the time.” Let us act upon it, or show Lincoln wrong. (August 1998)
Notes in 1999: (1) A 1999 UNDP report describes the Eastern European situation as disastrous, and calls for a quick joining up to the EU (De Volkskrant October 16 1999). It is courageous that an international body speaks up like this - and it indicates the seriousness of the situation. (2) The journalist Peter Michielsen in NRC-Handelsblad October 30 1999 rightly calls attention to the original borders between the empires of Rome and Byzantium. The Eastern European countries that are doing relatively well belong to the Roman area, the others to Byzantium. He mentions that this cultural distinction has also been noted by Andreas Oplatka of the Neue Zürcher Zeitung 1994, who again refers to George Kennan in 1945. I was a bit surprised by this, hadn’t thought about it in this way. (3) These points however nicely fit what I have been argueing for ten years now. Enabling people to help themselves starts with taking account of the local conditions; and overall the barriers to trade should go.
At the Dutch Central Planning Bureau, I helped making the Athena model (CPB (1990)) with its 7000 variables. I had this model at my computer and could let it do tricks like an obedient dog. But a proposal to an exercise effectively like the above was rejected by the directorate, and nowadays I am no longer in the position to make such proposals. The desktop computer that I have now, in 2004, might have more power than the 1990 mainframe, but I don’t have the data, the programs, and the possibility of discussion with colleagues. I have Word for Windows, Mathematica, some crucial books, an occasional visit to the Dutch Royal Library, and the internet (at low speed). Moreover, I have to make a living, in a different kind of job, and my time constraints thus are severe. This explains why I am forced to a logical argument - and this explains again why I emphasise logic anyhow.
Thus, crucially: it is up to the fellow economists to check my findings. They / you should actually do this anyhow, since a critical perspective always is best. For example: What are the data on the minimum wages in the other OECD countries ? OK, the OECD internet site shows that 1997 statutory minimum wage is 39% of median wages incl. overtime in the USA, 60% in France, 30% in Japan, etcetera, quite sizable [53] - but what about the tax void, the development, the indexation, the discouraged workers below the minimum, etcetera ? [54] What about the shifts of the Phillipscurves in this light ? What about the effects of the dynamic marginal rate ? How are these topics in all nations ? And what would happen, if all nations gain confidence about growth policies again, and they fire up each other and move all to a new higher growth path ? Clearly, the research agenda is huge.
The situation since 1989-1991 has been a bit like this: Me stating that unemployment has been solved (analytically) and inviting the fellow colleagues to check it - and nothing further happening. This book should make a difference in that I collect the various articles that I have been able to write since then. When others see the whole route then they will also better see the crucial junction where to take the other turn.
This may also concern the novel contribution to methodology below. [55]
Methodology may be seen as ‘economics applied to science’. The methodology of economics is the fixed point in that construct - even economic methodology in the traditional form as presented by Tintner (1968).
The ‘basic economic problem in science’ is - in my perception or definition - that some set of concepts can better deal with the data than another set. New ideas are like manna from the sky, but the manna must be collected, stored, compared to the older findings, etcetera, and an optimum must be found, using scarce resources over alternative ends. This ‘basic economic problem in science’ thus is quite different from the ‘mundane (non-basic) economics’ that, say, 5% more truth can be traded against 10% more effort and cost.
The mind has the economic problem of dealing effectively and efficiently with (i) old concepts, (ii) new information and (iii) the construction of new concepts. The name of the game is to have concepts or definitions fit reality as usefully as possible. The definitions must be chosen as strong as possible, so that uncertainty can be shifted to observation (and the problems with observation).
The human mind seems to be occupied with reduction of cognitive dissonance - or, at least, that is a fruitful way to look at that mind. Here I follow Aronson (1992a&b), who provides a definition of cognitive dissonance, and data and tests that lend empirical support for it. It appears that a commonly used method of reduction of cognitive dissonance consists of the rejection of new information to the advantage of older views. Frequently the messenger is blamed for the bad message, and even, after the messenger has been punished, the bad news is neglected since it came from an unreliable source - namely a person who had to be punished (while it is forgotten that, if the news is considered irrelevant, then there was no base for punishment). Man is a rather prejudiced creature, and thus not so effective and efficient at information handling - but man has to handle new information.

Barrow (1998:4) [56] provides us with a useful quote:
“This unifying inclination of ours is a by-product of an important aspect of our intelligence. Indeed, it is one of the defining characteristics of our level of self-reflective intelligence. It allows us to organize knowledge into categories: to know vast numbers of thing by knowing rules and laws which apply in an infinite number of circumstances. We do not need to remember what the sum of every possible pair of numbers is: we need know only the principle of addition. The ability to seek and find common factors behind superficially dissimilar things is a prerequisite for memory and for learning from experience (rather than merely by experience). (…)
All human experience is associated with some form of editing of the full account of reality (‘we cannot bear too much reality’). Our senses prune the amount of information on offer. Our eyes are sensitive to a very narrow range of frequencies of light, our ears to a particular domain of sound levels and frequencies. If we gathered every last quantum of information about the world that impinged upon our senses they would be overwhelmed. Scarce genetic resources would be lopsidedly concentrated in information-gatherers at the expense of organs which could exploit a smaller quantity of information in order to escape from predators or to prey on sources of food. Complete environmental information would be like having a one-to-one scale map. For a map to be useful it must encapsulate and summarize the most important aspects of the terrain: it must compress information into abbreviated forms. Brains must be able to perform these abbreviations. This also requires an environment that is simple enough and displays enough order, to make this encapsulation possible over some dimensions of time and space.
Our minds do not merely gather information; they edit it and seek particular types of correlation. They have become efficient at extracting patterns in collections of information. When a pattern is recognized it enables the whole picture to be replaced by a briefer summary form which can be retrieved when required. These inclinations are helpful to us and expand our mental powers. We can retrieve the partial picture at other times and in different circumstances, imagine variations to it, extrapolate it, or just forget it. Often, great scientific achievements will be examples of one extraordinary individual’s ability to reduce a complex mass of information to a single pattern. Nor does this inclination to abbreviate stop at the door of the laboratory. Beyond the scientific realm we might understand our penchant for religious and mystical explanations of experience as another application of this faculty for editing reality down fo a few single principles which make it seem under control. All this gives rise to dichotomies. Our greatest scientific achievements spring from the most insightful and elegant reductions of the superficial complexities of Nature to reveal their underlying simplicities, while our greatest blunders often arise from the oversimplification of aspects of reality that subsequently prove to be far more complex than we realized.”

This human property should be used in economics to explain actual events. Colignatus (1996d) for example applies Aronson’s findings in social psychology to economics, trying to indicate the actual ‘forces’. Another application is the very analysis in this book, for example where we stated earlier:
“If the government on the one hand would desire to use the results of scientific advice for its budget process, and on the other hand would not opt for an Economic Supreme Court, then its definitions would be logically inconsistent, and it would thereby tend to create a cause for dishonesty and improper manoeuvreing and thereby corrupt its processes.” (above)
While the above relies on structural models, the property
can also be modeled in the reduced form. Chapter 40 uses information indicator I
{0, 1}.
Another application is to the methodology of science. Methodology should harness this human property, and clarify when it is useful and when it is misleading.

Science aspires at a more unbiased approach. This unbiased approach also means the deliberate creation of cognitive dissonance, by creating new concepts and by looking hard at the evidence till it doesn’t go away anymore.
The evolution of knowledge can be described in terms of an ever increasing power in the concepts used.
The introduction of a new definition is not simple. The questions always are: does the definition cover the facts as we know them, does the definition not introduce hidden aspects that cause confusion and prevent advancement ? If a new definition wins out, it is, apparently, only so because it is believed to have passed the test. Though, we should be critical of this assumption. Only if the environment is ‘critical’, then we might presume a ‘survival of the fittest’ for concepts. (And all this is reminiscent of Dawkins’s ‘memes’.)
Definitions can be devious in quite vulgar ways. In the English economics literature, ‘perfect competition’ is defined as the situation when no agent can affect the price, i.e. all agents are price takers. The Dutch word for this case is ‘full competition’. The English definition forces English economists to use the word ‘imperfection’ for all other cases. Even quite reasonable cases, in the normal state of human life, when agents have market power but balance at some social optimum, would be ‘imperfect’. Also a natural monopoly would be an imperfection - even if one could not conceive the situation differently since the monopoly is a natural one. It would be better if the English economists would adopt the Dutch definition, so that the words ‘perfect’ and ‘imperfect’ could be used in their proper sense depending upon circumstance. This is just a vulgar example of how definitions can lead one astray.
The competition of alternative concepts can be quite sophisticated however. Let us illustrate this with three examples. The most illuminating example may well be Pythagoras’s theorem and its relation to the circle. This problem concerns mathematics, so that the discussion is less taxed by semantics and empirical matters - though there is of course the theory about empirical space. The second example of ‘falsification’ is surely in the realm of empirics. The third example concerns the distinction between determinism and volition.
Regard a triangle with perpendicular sides a and b and hypotenuse c. There are two points of view:
1. Pythagoras proved [57] that the square of the hypotenuse equals the sum of squares of the perpendicular sides, i.e. that a2 + b2 = c2
2. For the circle, it is taken as the defining quality of the circle, and thus accepted without proof, that the points are at equal distances from the origin. In other words, a circle with radius c is defined as the collection of points (a, b) at a distance of c from the center. Thus a2 + b2 = c2 by definition.
The two points of view are presented in Figure 16. The definition of the circle can be taken for granted, since it is just a definition. On the other hand, it will be very useful to discuss the proof of the Pythagoras theorem, since then we see the need for a proof.
Let us take the square with sides z = a + b and surface z * z = z2 = (a + b)2. Within this square we can see four triangles with straight sides a and b and hypotenuse c, as has been done in Figure 16 in the square on the left.
In the square, another tilted square has been drawn, with sides c and thus a surface of c2. There are four surrounding triangles, each triangle has a surface of ½ a*b. The surface of the large square is equal to the surface of the tilted square and the four triangles.
Figure 16: Pythagoras and the circle

Thus:
· From the big square itself: z2 = (a + b)2
· From the tilted square and the triangles: z2 = c2 + 4 ab/2.
Elimination of z then gives a2 + b2 = c2.
This proof has been taken from DeLong (1971), and he remarks that Pythagoras proved it differently.
How do we explain that one and the same equation can have two interpretations that are so widely different, one with the need for complicated proof and the other with direct acceptance by definition ?
There may be other explanations, but I think the following will do fine. Note that the definition of the circle relies on the notion of ‘distance’. There are two points of view again, so that point 2 above actually splits in two parts:
2A) Basically the (Euclidian) distance between two points can be measured by a straight line section. That is rather simple, and makes for a readily acceptable definition of a circle.
2B) However, in a system of
co-ordinates, that distance can be reinterpreted in a representation in terms
of the co-ordinates. There are two possibilities again. Either the distance can be defined as simply the formula dist[{x, y}, {a, b}]
((x - a)2 + (y
- b)2 ) with {x, y} the origin - above {x, y} =
{0, 0} - or it can be defined geometrically as the hypotenuse of the
differences of the co-ordinates. If either definition is accepted, then one can
use Pythagoras’s theorem to derive the other.
The essential difference between (2A) and (2B) is that (2A) is elementary and poor in concepts and results, while (2B) is complexer and rich in concepts and results. Viewpoint (2A) only allows us to use measuring rods between arbitrary points and little else. We are allowed to sweep the rod around the center, and thereby draw the circle, but then it somehow stops. Viewpoint (2B) allows us to do much more. A line between two points is interpreted in terms of a system of co-ordinates, and that opens the scope for new results.
We find that the opposition of (1) against (2) is rather messy, and (2) actually hides two suppositions. The ease of (2) depends directly upon the ease of (2A), while (1) actually compares with (2B) that is complexer. The phrase “In other words” in (2) above thus was misleading, and actually represents the introduction of another assumption.
With this clarified, we also note that (2) is stronger than (1), and that it was possible to seduce the human mind to accept (2) rather easily. There has been a progression in concepts, resulting in stronger definitions.
Note that behind all this there is a notion of empirical space. In (1) there is a hidden assumption of a flat space. In (2B) the assumption is made explicit, and then open to amendments (curved surfaces, or abstract spaces). The movement of (1) to (2) thus is, partly, (a) the advancement in concepts by means of the definition of distance (and the circle as a collection of equal distance points), (b) the introduction of the separate step of observation - with the difficulties: when does the definition apply to reality, or if there is some reality, how do I select the proper definition ?

The point that is relevant for this book then is: that the definition is so good, that it in practice substitutes for many everyday empirical problems. A criterion for a good definition is: that it can be such a substitute.
When a definition is a close substitute for reality, then it may percolate into common culture with more authority. For example: every citizen can establish the existence of a tax void and Pareto suboptiomal unemployment purely from the logic of the level of gross minimum wages and the official tax statutes - and we don’t need big computers or official bureaus to do some econometrics and then tell us.
Admittedly, there is danger in seductive and seemingly right but wrong definitions. If ‘child’ is defined as ‘irresponsible young human’, then we may be tempted to treat children as such and forget to expect the responsibility that they can handle. But the existence of this danger should not make us close our eyes to the advantages of good definitions.

A side issue concerns our concept of ‘space’. Let us first consider an example of cultural relativism. It appears that different human cultures can have different approaches to one’s orientation in space, and that these approaches are wired into the languages used. [58] Taking a point of reference can be done in three ways: (1) Relative: taking one-self (“the tree is to the left of the house” - seen by me); (2) Absolute: taking the sun (“the tree is to the west of the house”); (3) Intrinsic: taking one of the objects (“the tree is to the back of the house”). If someone is asked to copy a situation in front of him towards a place in the back of him, then there will be a different ‘copy’ depending upon one’s language/culture. If you have a cup of coffee and a pencil in front of you, pick them up, turn yourself around, and recreate the scene, then a Westerner will use relative positions, while an Australian Aboriginal will use absolute positions (and turn the relative positions around). The question now is: while this only concerns the point of reference, can we imagine something similar that affects our concept of space itself ?
I take the position that the human mind apparently is able to conceptualise Euclidean space - and that this actually defines our concept of space. If we take a non-Euclidean geometry - such as a globe - then this still can be imagined to exist within Euclidean space. Pythagoras’s theorem is invalid for triangles drawn on a globe, but to hold that space is a globe would be erroneous - since our definition of space would be Euclidean.
One of the questions often posed is whether the universe - interstellar space - is Euclidean or not. This is a badly posed question. If we define space as Euclidean, then it is another question whether a ray of light follows a straight line or is deflected by gravity.
Barrow (1998:p42-44) provides a troubling quote: [59]
“The most important consequence of the success of Euclidean geometry was that it was believed to describe how the world was. It was neither an approximation nor a human construct. It was part of the absolute truth about things. (…) This confidence was suddenly undermined. Mathematicians discovered that Euclid’s geometry of flat surfaces was not the one and only logically consistent geometry. (…) None had the status of absolute truth. Each was appropriate for describing measurements on a different type of surface, which may or may not exist in reality. With this, the philosophical status of Euclidean geometry was undermined. It could no longer be exhibited as an example of our grasp of absolute truth. (…) These discoveries revealed the difference between mathematics and science.”
This quote is troubling for the following reasons:
1. If we define ‘space’ as Euclidean, then it is an absolute truth. This definition seems to maximise our information power. Other surfaces can be imagined within that space.
2. One might think of ‘empirical space’ as something that must be measured. The idea is: ‘If it cannot be measured, then it is not relevant.’ OK, this seems fine in principle. But if a physicist would use ‘light’ as a measuring rod, then this is asking for problems. Namely, Euclidean geometry already provides us with our system of measurement. Defining ‘empirical space’ differently would conflict with our original definitional grasp of space. Better is: to stick to the definition, and regard measurements that deviate - e.g. from gravitational deflection - as the physical properties of the objects and measurement tools involved.
3. That
there is a difference between mathematics and science does not disqualify the
notion of absolute truth. A true deductive sequence ‘Assumption
Conclusion’ has absolute
truth. And it should be realised that scientific theories are mathematical
(with the scientist working on an assumption).
4. It is possible to translate the Dutch ‘lijn’ as ‘point’, and ‘punt’ as ‘line’ (thus conversely) and still find a consistent model for Euclid’s axioms. But this is a mathematical exercise, and it does not necessarily have to do with ‘space’.
So it seems that Barrow and I agree for 99%, but still, the 1% difference features big in some dimension. Note that the discussion here concerns more a side issue, but it remains useful to indicate the deeper aspects of Pythagoras’s theorem.
The ‘principle of falsification’ is that hypotheses are only scientific if they are formulated such that they are vulnerable to empirical testing, and might be falsified. It has been formulated by Popper, see Keuzenkamp (1994).
The principle has two disadvantages: (1) purely logical, (2) stochastically.
(ad 1) Take logic first.
Counterargument 1. Regard the statement All ravens are black. This statement will be false when one finds a non-black, say white, raven. So the statement would be an acceptable scientific hypothesis, since falsification is possible in principle. But, as the falsificationist would hold, it would remain a hypothesis, and we should be aware of the fact that is only a hypothesis, until it had been checked for all ravens (Tintner (1968:12)). This falsificationist view however is problematic, since most of us will sense that there is truth in All ravens are black, for example by our definition of a raven.
Counterargument 2. In the extreme, all scientific knowledge would consist of instances of falsification. It has been falsified that the Earth is flat, that atoms cannot be broken, that ... But the principle itself, i.e. that ‘all scientific knowledge would consist of instances of falsification’, is a definition and is not open to falsification.
While falsification may be a successful research strategy in many cases, it does not seem to be a fully satisfactory way of organising science, at least from these two points of logic.
(ad 2) Take stochastics next. Let us regard the typical modelling situation:
|
The model: Estimation: Observation X[+1] forecasts: Final observation: |
y = X ß + y = X b + e yest[+1] = X[+1] b + Exp[e[+1]] y[+1] |
The question now is whether this new observation can falsify the hypothesis of the empirical estimate. This question is not as simple as the naive falsificationist first had in mind. The principle of falsification is formulated as for deterministic reality, while many empirical models are stochastic. In stochastics, there may be deviations, and sometimes large ones. There are problems of measurement in y and X, the choice of the functional relationship, missing variables, and the choice of the stochastic specification itself.
One useful empirical answer is optimal control, with the example of a rocket launched to the moon, where there is continuous adjustment to observed error (‘falsification‘). This control only works well when there is a proper definition of the loss function. The issue of the loss function is a crucial one, but this is not falsificationism.
Logic and stochastics cause me to take the following position.
There is a difference between all1 (universal) and all2 (generally, usually, normally). The statement All ravens are black can be seen as:
1. a definition. It then holds universally. Empirical truth then is conditioned to the logical tautology of the definition that we have chosen. If we find a white bird that looks like a raven, it cannot be a raven. (But we think that this definition covers reality, for example since we have some ideas about genetics and evolution.)
2. an empirical statement - grounded in a stochastic model. It is shorthand for All ravenlike birds tend to be rather black or whatever the professional might deem correct. The meaning of such statements is more subject to context than in the case of well-groomed definitions.
The human mind thus faces the choice: To adopt a definition and run the risk that this does not fit reality so well, or to adopt a statement on averages and work out more details of the empirical loss function. Decisions on such statements thus are sensitive to the loss function, but the second category requires more detail.
This of course does not solve everything. The distinction of these two dimensions or perspectives is not like solving all problems in their domains. Also a definition like All ravens are black by definition does not answer the question whether a particular object is a raven or is black. Is a size of 10 kilometers acceptable ? Did we look in daytime or at night ? Must it be alive, and then, what is life ? So the distinction between definitions and empirical statements is useful, but it does not solve all problems. The point is not quite that one can always adjust definitions, but rather that a definition is not reality by itself. (Though it can get close.)
At one point in history, scientists were willing to accept the periodic system of elements to catalogue the wide variety of materials around us. There was apparently little loss involved in accepting these definitions, or Lavoisier’s periodic table was more gainful than other catalogs. The definitions did not change the materials, but facilitated more efficient research. At one point in history, see Mirowski (1989), economists were willing to analyse human behaviour in terms of utility maximisation. The approach is an empty box, since any behaviour can be described as such. For example satisficing behaviour can be represented as minimising the distance from satisfaction. Also in ‘evolutionary economics’ the utility maximisation model can be applied though these researchers are critical of this approach. (While, curiously, Charles Darwin was inspired, amongst others, by Adam Smith.) The new approach for laboratory experiments makes us even more critical about the rationality hypothesis. Utility maximisation however helps organising one’s thoughts, helps professional discussion, facilitates modelling and empirical estimation, and is generally considered an advance above less explicit approaches.
As with the Pythagoras example, but now empirically, there is a switch from just empirical knowledge to a set of definitions, when the loss function allows it.
Kuhn (1962) describes major changes as ‘paradigm switches’ (though someone noted that he used that word in perhaps 40 ways). I rather draw attention to the change from empirical knowledge to definition. This change need not be a paradigm switch. Paradigm switches may be the most intriguing or flashy examples of the introduction of new definitions, but the change from empirical knowledge to definition does also occur in ‘normal science’.
Holland around 1600 had the theological argument between Gomarus who defended predestination and Arminius who defended a measure of volition. This discussion had started before them, didn’t end with them, and continues till this day, also in these pages.
The 20th century gave a novel twist to the argument, namely quantum mechanics. Instead of the folly of the gods, there now is a randomizer with a scientific garb. If objects, and the molecules in our brains, have random aspects, then this would be neither determinism nor volition. Quantum mechanics normally is applied at the micro level of particles, and there is the suggestion that larger aggregations of masses still would behave in the Newton-Einstein fashion. Schrödinger however gave an example - his cat - how quantum mechanics could also extend into this macro world. So the challenge to the debate on predestination is real. [60]
The quantum model is stochastic of itself. This differs from the randomness caused by simple measurement errors - the randomness commonly used in economics. However, economics has some purely stochastic models of itself too. There is for example the Erlang queueing model. Consider a postoffice with clients arriving and being served. Interarrival and service times can be modeled with exponential distributions, and this allows us to determine the average length of the queue, the average waiting time, the average utilisation rate of the service window, and such. If the situation gets more complicated, then research economists use computer simulation models to find the best way of operation. This example shows that economics already is familiar with a model that is stochastic in itself. Note that there are some ways to re-introduce a degree of determinism - as your barbershop may require you to make an appointment. The basic observation that we make here is that the stochastic approach is basically a modeling method, and there is no implication that arrival and service are intrinsically random.
The discussion above introduces the various components, and the question now becomes what to make of it all. The following gives my solution.
First of all, science by definition avoids the ‘deus ex machina’ assumption. An understanding of reality is looked for without reference to a god. So our discussion is not burdened with the associations of eternal damnation (and predestination to this).
Secondly, science by definition aspires at a deterministic understanding. Scientists may adopt a stochastic approach with only a limited degree of accuracy, but the target remains a 100% accuracy - which is determinism. Hence, by definition, scientists have a deterministic predisposition. [61] [62]
Thirdly, the idea of a ‘free will’ is a moral category, differing from physics. Admittedly, the scientific approach would presuppose that our moral considerations depend on our brain, and the movements of electrons and molecules that could be caught in a determistic model - but the proper conclusion is that we don’t have that model yet. The existence of time, and in particular the uncertain future, is a precondition for morality. An ‘existence proof for God’ would be that in the limit of time, prediction accuracy rises to 100% and all moral beings are going to make the proper moral choices. [63] But we don’t know for sure that those choices will be really moral - and anyway it is hard to see how this could affect us. For example, we may predict, as social scientists, that when economic conditions worsen, that politicians then may be more inclined to morally dubious choices. But we need the passing of time to determine whether this prediction materialises - and, as human beings, we would still want to form a moral opinion and discuss the moral aspects. The conceptual gap between ‘ought’ and ‘is’ remains. Eventually there might be a practical (non-conceptual) bridge, but for those same practical reasons it isn’t there yet.
Though science does not refer to gods, we can use a god anyway for clarification. Janus, the Roman god and name-giver to the month of January, had two faces, one to the past and one to the future. Figure 17 uses the Janus head as an analogy to locate the various concepts.
Note: This only displays the three opposing concepts
in one picture,
without implying that all concepts to the left are equal
or that all concepts to the right are equal.
The Janus head analogy works only up to some degree. We don’t know all that happened in the past, we can use probability statements for the past too, and thus we cannot replace ‘past’ with ‘certainty’. Similarly, as said, science has a deterministic predisposition, so the future basically is predetermined from a scientific point of view. Yet the head analogy is useful, since it focusses our attention to these various subtleties.
Thus, clearly, the Arminius and Gomarus debate can be seen as non-sensical if they got the two categories of science and morality confused. Even though we can have a deterministic predisposition, we still can have moral volition (and be judged by jurors on making wrong choices). Their debate would be proper in so far as Gomarus would take predestination in a moral sense - but then the debate is not relevant for us.
Thus, clearly, quantum mechanics drops out as a fundamental category. It only remains as a research strategy in the face of apparent difficulties, but it still is on the road to 100% accuracy.
Admittedly, quantum mechanics itself seems to pose that nature would have random properties at the micro particle level. Some even argue that this would be the basic example of true probability - while all other ‘examples of probability’ (like throwing dice) are basically deterministic (and we only use probability techniques to make up for our lack of knowledge or laziness in measurement). In particular, Richard Gill, professor in mathematical statistics at Utrecht university, gives this argument at a roundtable discussion:
“We should be collectively ashamed not to know anything about quantum mechanics. I would like to see all introductory texts in probability theory going a little into the physical (quantum) theory behind the geiger counter before using some data of alpha particle counts as an illustration of the Poisson process; I would like a discussion of the Bell inequalities together with a modicum of quantum mechanical background to show how elegant probabilistic reasoning shows that the quantum world is truly random (unless you would like to go for an even more weird non-local deterministic theory).” (1997b)
Indeed, also economists are familiar with the concept of Brownian movement, or the random walk, and use this model for example in analysis of the stock markets. Or in the labour market, with labour supply LS and employment LE, unemployment is u = 1 - LE/LS: but u then basically is a probability, since the model does not provide an additional explanation why one person works and the other doesn’t.
But Gill’s argument does not convince me. The point is: you may pose that nature would be such, but you don’t know for sure. You are still using only a model. The scientific challenge remains to develop a model that increases accuracy.
Yes, there is the Heisenberg uncertainty model that if you measure position then you no longer know speed, and if you measure speed then you no longer know position: and this model nicely captures a basic notion of uncertainty. But, try for a better model then - and take some thousands years more to do so. [64] [65] [66]
As a corollary, we can take a position on path-dependency (hysteresis) and chaos.
Some authors use the word ‘chaos’ in the sense of path-dependency. For example, a small variation in first conditions (starting point, parameter) can cause a widely different result - a butterfly flapping a wing can cause a tropical storm. Since we already have the term ‘path-dependency’ for this, we better reserve ‘chaos’ for the meaning of ‘seemingly random’. A chaotic system, in this proper sense, then gives a fully deterministic description, but the outward appearance that some variables would be random. Here it is strange that people who are in favor of ‘chaotic modeling’ also use this to be against determinism.
Path-dependent and chaotic models can be useful. The orbit of Earth around the sun looks solid, but over the billion years it seems pretty random. There is Schrödinger’s cat model that shows the macro world depending upon a micro state. There are the strange models in history and biology, where for example a meteor wipes out dinosaurs. OK, all these models exist, and they can be real good descriptions of true states of nature. But all this does not disprove the definitory deterministic predisposition of science. If you would run the movie again from the start (which is currently said to be a Big Bang, but I don’t know about that), then you would get, by the models that science tries to develop, the same result. If you would argue that anything else might pop up, and your mother could be a dinosaur with a pig’s head, and if you would develop models that would show this, then you are quite in danger of being out of science. (You would drop out on this definition, but could be in on the other criteria.)
Concluding this section, we find that definitions indeed guide our understanding of nature. The definition of science itself guides our perceptions - for example when it guides us into taking quantum mechanics as a model only instead of as ‘reality itself’.
A reason to be strict about this definition of science is that people, who would argue that nature is basically random, would also tend to reject deterministic results of science. A deterministic result of science is for example (1) that divergent indexation of tax exemption and the standard of living causes a tax void, and (2) that the existence of a tax void can be used to ‘abolish taxes’ without costs. It would be a pity if this result were to be rejected because of a fundamentalist ‘random view of the world’.
Our subject is the political economy of western welfare states, and in particular employment and inflation aspects. This subject is quite complex, and we must be modest about our results. Of course we can use statistics of the national accounts, and thus indirectly we use the statistical labour of thousands of statisticians, and indirectly the results of thousands of firms and of millions of citizens that filled in their tax forms. Economic literature provides a wealth of models and interpretations of these data. In my case, I also rely on my own experience in constructing a national economic model. All this, however, does not mean that we can forget about modesty, on the contrary. Nevertheless, it is my conjecture that we can achieve a more enduring result than just awareness of complexity.
What is interesting in economic discourse is the concept of ‘stylized fact’. When an economist observes some regularity, he is rather inclined to use that term. We shall use the term more conservatively, and we are hesitant about observing regularities. But we also can fruitfully employ the term when there is a regularity indeed. In some cases, when the regularity is so strong that our loss function comes in the epsilon zone, then we even can switch to definitions.
So we adopt the methodology:
(a) state what we consider to be the stylized facts
(b) define our concepts so that the stylized facts are covered by definitions
(c) develop theorems and proofs
(d) link back to conclusions about reality.
A proposition - as a statement on reality - can be regarded as a mathematical theorem about/within a model of stylized facts. When there is a tautology, we attain truth by definition.
We here deliberately refer to Bochenski (1956, 1970:20): “The word ‘proposition’ has been variously used, (...) nowadays commonly as the objective content of a meaningful sentence”.
Some students of the History of Economic Thought will see a clear resemblance of above methodology and what Schumpeter called the “Ricardian vice”. Quoted by Tintner (1968:7):
“His interest was in the clear-cut result of direct, practical significance. In order to get this he cut this general system to pieces, bundled up as large parts as possible, and put them in cold storage - so that as many things as possible could be frozen and “given”. He then piled one simplifying assumption upon another, until, having really settled everything by these assumptions, he was left with only a few aggregative variables between which, giving these assumptions, he set up simple, one-way relations so that, in the end, the desired results emerged almost as tautologies.”
This is almost exactly what we shall do, except that we generate tautologies.
Step (d) comes closest to the Popperian falsificationist criterion. Our deductions need not be insulated against testing, even though this present book abstains from econometric testing since we are too much involved in creating our concepts and constructing consistent and useful propositions. [67] Abolishing the Tax Void is a good and cheap test anyway for the relevance of this analysis.
It is useful to keep Solow’s comment in mind:
“There is something deeply satisfying - not to say suspicious - about any proposition that seems to deduce important assertions about the real world from abstract principles.” (1976:148)
So, advisedly, the reader better checks what we are doing here, and governments should run their own regressions and models before they make policy decisions. But of course I only dare to present my results here since I am confident that they, in the hands of competent and true scientists, allow a real advancement.
In his essay “A discipline not a science” (1983:365-375), John Hicks argues that economics is too far from the accuracy reached in the material sciences, and explains that he cannot ‘altogether’ deny that he himself has converged on a ‘critical’ attitude. This attitude concentrates on the clarification of terms, i.e. their definitions, also by using quite unrealistic models. For example: “Though the concepts of economics (most of the basic concepts) are taken from business practice, it is only when they have been clarified, and criticised, by theory, that they can be made into reliable means of communication.” (p372-3).
Hicks then concludes that economics is a Discipline. His quote of Keynes (in II.7) above is taken from these pages. My position on this is twofold - the position of hard science with soft data. On one hand I embrace the critical attitude. Indeed, we should develop sound definitions, and remain critical about how these are applied in communication. That is the meaning of the Definition & Reality methodology. And it brings us far, since we can advise to abolish the Tax Void without running regressions and a computer model. On the other hand, Tinbergen’s efforts have not been in vain, and models with estimated coefficients are useful tools for policy analysis. For example, some economists may reject the existence of a Phillipscurve, and all economists should be critical about the data and the parameter values, but such a relationship remains useful in a macromodel that is used for evaluation of policy alternatives. It would be curious to accept the concept of a ‘model’ and to accept other relationships like a consumption function, and reject the use of a Phillipscurve: even though the uncertainties are quite comparable.

In other words, our method remains econometrics, even though we end here with an increased awareness of the role of definitions. We are just in the phase that running regressions is useless if the model is no good. Regressions come in only when we have a good candidate, and regressions even might benefit from some definitory relationships. We even would like to do those regressions ourselves if we had the data and the time. So, for now, let us first develop what we conjecture to be the proper model.
There is the useful distinction between the structural and reduced form:
· the structural form represents actual relations as good as possible,
· the reduced form gives the simplest representation, with the interaction minimised.
With y a vector of endogenous variables, x a vector of exogenous variables, and f and g functions, then a structural form is y = f(y, x) and a reduced form is y = g(x).
Since econometrics can only approximate reality, the true structural form can only be approximated. What we consider to be a structural form is an intersubjective consensus. We anyhow have to adopt an approximation, which means that many factors have been removed. However, for two models we can often clearly see that one is simpler than the other, and then we can usefully apply this distinction between the structural and reduced form.
The distinction between structural and reduced form also affects the structure of this book. The next chapters concern the structural form, actually starting with the textbook IS-LM model. We relax the assumption of homogeneous labour, and introduce heterogeneous labour. First we look at labour supply only. Then we look at supply and demand, and at the equilibrating dynamics, which causes the topic of the Phillipscurve. We show how the Phillipscurve and the Constant-Wage-Inflation Rate of Unemployment (CWIRU, a.k.a. NAIRU or natural rate) shift as a consequence of minimum wages or poverty. We then relate minimum wages and poverty to developments in taxation. The co-ordination failure on taxes and minimum wages not only causes the internal imbalance on the labour market, but also an external imbalance, with international trade.
The discussion of the structural form results into the need for more scientific clarity. Though much seems to depend upon empirical parameters, some aspects however are more fundamental. This leads to the discussion of the reduced form. We first develop a theorem on the influence of taxation on employment and unemployment regimes in welfare states. Since taxation depends upon social choice, we then discuss Arrow’s theorem on social choice (structural form again). We also note that there may be a confusion about inefficiency and the existence of a ‘free lunch’. Having established the possibility of rational social choice, we then develop a theorem on stagnation in the policy making process (reduced form again).
In chapter 8 we stated: “If the government on the one hand would desire to use the results of scientific advice for its budget process, and on the other hand would not opt for an Economic Supreme Court, then its definitions would be logically inconsistent, and it would thereby tend to create a cause for dishonesty and improper manoeuvreing and thereby corrupt its processes.”
We can directly apply our Definition & Reality methodology. The point is that desiring for a scientific base and not making a Court is logically inconsistent. Parliament and President may ‘define’ their ‘Council of Economic Advisers’ as ‘scientific’ but when there are little safeguards, then reality takes over, and the Council will de facto not have sufficient power to resist political meddling.
The appendices contain an example draft for a Constitutional Amendment for an Economic Supreme Court and a description, taken from the White House internet site, of the CEA. The difference should be clear.
Law-givers know: If a law does not fit logic and reality, then people will see themselves forced to ‘break’ the law. “You are damned if you do, and damned if you don’t.” People in such situations will tend to grow dishonest, since it is often easier to massage events rather then clearly state that the law is impossible and go on strike or whatever. They don’t see it as ‘dishonest’, but as ‘flexible’. And once people are on that road, they will rationalise their behaviour by thinking that this is the way that the world works, and become more willing to perform other acts of dishonesty.
Conversely, once sufficient safeguards are in place, then the Council is de facto an Economic Supreme Court (even if it does not have that name). With a properly defined scientific base for the budgetary process, economists could also more confidently predict the economy’s course, since there would be less random noise and chaos about the application of known knowledge.
We consider all Western economies, or, more properly with Japan included, the OECD area. Hence, the student of this book will expect masses of OECD data, and masses of structural models of the OECD countries, or at least a model for the whole OECD area. There is none of that. We in fact use only some example data for the small country of The Netherlands. Why is that ? And how can we possibly utter our ambitious claims ? The answer to these questions is fourfold:
· there are mathematical theorems and proofs for the reduced form of a typical welfare state
· we use some key properties that will be documented here
· this chapter on methodology explains the validity of the method
· for the data and structural models we refer to ‘existing economics’.
The approach of this book is to use logic in order to circumvent the uncertainty of parameter estimates. Though the book doesn’t give full statistics, it is conjectured that the theorems capture the stylized facts. A proposition - as a statement on reality - can be regarded as a mathematical theorem about/within a model of stylized facts. When there is a tautology, we attain truth by definition.
Our first proposition establishes conditions under which both unemployment and full employment are possible. This relates to the partial arguments of economists about the labour market. Our second proposition gives the integral argument, or general theory, how (un-) employment situations are managed. The employment regime can be chosen by conscious choice, or there is lack of knowledge. Lack of knowledge forks into two cases. With full employment, the situation is dubbed ‘chance’. With unemployment, it is called a co-ordination failure.
It is useful to state that our point of departure was not mathematical economics itself. This book has been written against the backdrop of the voluminous studies Central Planning Bureau (1992a&b) and Colignatus (1992). It is from this experience that these two propositions have been selected as being of foremost importance. We want to focus on main mechanisms that block full employment and prosperous growth in modern welfare states. It is thought that the two propositions, in a sense simple but in another sense complex, help to clarify a fruitful direction for both analysis and policy improvement.
To be sure: this approach does not imply a rejection of time series econometrics ! I am an econometrician myself. Below I will e.g. develop a definition of ‘risk’ that deals with uncertainties - and in my view the 95% confidence interval should be replaced by an interval based on a well specified loss function. So I am supportive of uncertainty approaches. However, econometric models also contain definitions and institutional equations, and it is my conjecture that these have not gotten the attention required. In particular the regime switch of 1950-1970 to 1970-2005 will be difficult to determine by time series methods. Studying marginal changes within a regime will not uncover results about the switch. It would be wrong if time series analysts would only accept time series as data, and not such regime states. The Definition & Reality methodology then can help us out. [68]
Governments that become interested in the present analysis will no doubt require that it is tested against the data of their own country. This is advisable indeed. However, the claims of this book are primarily mathematical certainties, and additional empirical data will mainly provide didactic assurance. Since country parameters are different, practical policy must rely on the structural models of course, and data will be needed for detail decisions. But at an abstract level, the developments would be similar.
Chapter 23 gives a textbook macro-economic model so that we better appreciate the point of reference of ‘existing economics’. Chapter 24 clarifies heterogeneity and nonlinear taxation. There is nothing new here yet either. The subsequent chapters then take up the same subject matter, and gradually add elements and interpretations that support the novel analysis.
Our textbook model is a very simple and unpretentious first year undergraduate model. It is not interesting for itself, but for our later discussion.
We follow Dornbusch & Fischer (1994), chapters 1 - 4. The basic macro-economic identity for annual real values is:
C + G + I + NX
YR
YD + (RTAX - TRF)
C + S + (RTAX - TRF)
|
C = consumption G = government consumption I = investment NX = exports minus imports YR = real gross domestic product |
YD = YR - RTAX + TRF = C + S TRF = government transfer payments [69] RTAX = real tax revenue DEF = G + TRF - RTAX = S - I - NX S = saving [70] |
We take G, TRF and NX as exogenous and known. We are now only interested in expectational equilibrium. Aggregate demand is YR* = C* + G + I* + NX. With the rate of interest i and the marginal tax rate r, behavioural relations are:
C* = TRF + c (YD* - TRF) + C0
I* = I0 - b i*
RTAX* = r YR*
In equilibrium C = C* gives YR* = YR - since C = C* iff YD = YD* iff I* = S* = I = S. This can be represented by the IS curve:
YR = TRF +
c (YD* - TRF) + C0 + G + I0 - b i + NX 
i = (C0 + G + I0 + NX + TRF - (1 - (1 - r) c) YR ) / b (IS)
For the money and bond market:
L + DB
WN / P
MX / P + SB
|
L = demand for real balances DB = demand for real bond holdings SB = real value of the supply of bonds |
WN = nominal financial wealth P = price level MX = money stock (M1, M2 or M3) [71] |
Liquidity demand is:
L = k (1 + h / (i - imin)) YR
Equilibrium on the money market L = MX / P gives the LM curve:
(LM)
Intersection of the IS and LM curves gives equilibrium for YR and i, and from these the other variables can be solved, in particular the price level P = MX / L[YR, i].
Note that we also use: [72]
Y = P YR
While the IS-LM model already tells us something about inflation - via the quantity of money - there is also the labour market where wages drive up costs and prices. The IS-LM sectors of the economy and the labour market are linked via Value Added Y.
For our purposes we can use a Cobb-Douglas function with employment LE and capital KE:
YR = Y0 LE a KE 1 - a
Y
P YR = W LE + i PK KE,
We assume that firms maximise profits - and since we assume constant returns to scale, there is no surplus. If firms accept wage W, then the marginal productivity of labour equals the real wage W / P, and then this determines LE which must be at most labour supply LS. Unemployment then follows as u = 1 - LE / LS. If companies also accept the rental price of capital, then the marginal productivity of capital must equal i PK / P, and this determines the employed real capital stock KE, which must be at most total stock KS.
The additional equations from these marginal conditions are (and we assume expectational equilibrium on these too):
LE =
Y / W
KE = (1 -
) Y / (i PK)
With YR, P and i given from above, there is one degree of freedom from either PK or W. It is customary to close the model with a relationship that sets the average wage W. [73]
|
YR = real income LE = employment KE = employed real capital stock KS = total real capital stock |
LS = labour supply u = rate of unemployment W = average wage WT = W LE = total wage sum |
In a full model, the price of capital must relate to investments I and to wealth WN. Also, apart from a theory on unemployment, we also need a theory on idle capital KS - KE. We could also include intermediate goods, as these appeared to have been important in the Oil Crises. These alternatives however lead too far for our purposes.
Important for our purposes however is inflation. We already indicated that the price level P is relevant for inflation. The crucial thing to note is that inflation is the relative change of the price level, so that it is a dynamic concept.
Let p be an arbitrary price.
Statics assumes a timeless dimension. With supply S[p] and demand D[p], equilibrium (in expectations) is given by S[p] = D[p] and it solves for the equilibrating price p·.
Dynamics concerns developments in time. The price movement p’ = dp/dt is related to excess demand D[p] - S[p], so that p’ = dp/dt = f[D[p] - S[p]]. The solution of this differential equation gives the movement towards equilibrium. Dynamics causes different concepts of equilibrium: depending upon the specification of variables and function, the equilibrium can be market clearing (p°) or the fulfillment of expectations (p*). Economic agents generally have different speeds of reaction when expectations are not fulfilled. When there are surprises, there can be a ‘trade-off’ between prices and quantities.
For the labour market, dynamics implies a relationship between unemployment and the change in wages. This relationship is called the (wage-) Phillipscurve. Sometimes there is an additional assumption of a strong relationship between wages and product prices, [74] and then the (price-) Phillipscurve gives the relationship between unemployment and prices.
The existence of a Phillipscurve thus follows essentially from the concept of dynamics itself. For the labour market, the price is the wage w and excess demand is represented by unemployment u (thus negative excess demand; with vacancies neglected partly because of unreliable measurement), so that w’ = f[u]. Much debate in macro-economics about whether the Phillipscurve ‘exists’ or not, could have been cut short by noting that it is a standard market adjustment equation. The true debate is about the proper form and stability of its parameters.
In the simplest model we choose inflation, [75] and have, with u = 1 - LE /LS:
dLog[P] = f[u]
and this would add another restriction that closes the model. For example:
dLog[P] = dLog[P]* - 0.1 Log[ u / u* ]
would give an expectations augmented form, and when u = u* then expectations will be fulfilled, and LE = LS (1 - u*).
It is useful to note that above model does not yet contain an explicit reaction function of the monetary authorities with regarding to inflation. Money can be fixed or chosen to grow at a predetermined rate. In practice there will be a flexible reaction, and then part of the ‘Phillipscurve regression between dLog[P] and u’ will reflect that reaction function.
The textbook relations are simple in themselves, but the interactions already can be rather complicated. Figure 18 presents some common macro-economic interactions.
Figure 18: Some macro-economic interactions

The influence of income in that figure is stated in terms of growth dLog[YR], [76] and the influence of prices is stated in terms of inflation dLog[P]. Positive transmissions are in black and explained in Table 5, negative transmissions are dashed in red and explained in Table 6.
|
Positive |
Cause |
Prime effect |
Then |
Then again |
|
YR |
growth |
increases demand |
adds to inflation |
|
|
u |
more unemployment |
less income, less tax revenue |
more expenditure on benefits |
higher deficit |
|
P |
more inflation |
the Central Bank (CB) raises interest rates to fight it |
possibly, though, inflation means more profits and a reduced demand on loans |
and thus a lower rate of interest: but then the CB will maintain the level of interest |
|
i |
higher interest rates |
the government has a higher interest bill |
higher deficit |
|
|
DEF |
a higher deficit |
more demand for loans, more supply of bonds |
thus a higher rate of interest |
|
|
DEF |
a higher deficit |
sustained expenditure |
and thus sustained growth (at least by that channel) |
|
|
Negative |
Cause |
Prime effect |
Then |
|
u |
more unemployment |
lower wage demands |
and thus less inflation |
|
P |
more inflation |
more tax revenue |
and thus a lower deficit |
|
i |
a higher rate of interest |
makes investments more costly |
and thus lower growth |
|
YR |
more growth |
more demand for labour |
lower unemployment |
Homogeneity assumes that S[p], D[p] and p are real variables, while heterogeneity assumes vectors or densities. This book takes the density approach. In fact, employment e[w] = Min[s[w], d[w]] also provides the earnings or income distribution, i.e. the function that gives the number of people earning a level of income w, for labour supply s[w] and labour demand d[w].
The proportional tax is r Y. A linear but non-proportional tax is Bentham[w, x] = r (w - x), though proportionality comes back again by assuming x = 0. A nonlinear tax adds curvature (see chapter 29), and then interacts with heterogeneous labour.
The following references put the argument into perspective.
In his presentation of the IS-LM model, John Hicks (1937) could disregard differences in labour as being of secondary complication. For our purposes, however, the case of heterogeneous labour causes a crucial difference. Policy co-ordination then involves three distributions:
1. the gross income distribution that corresponds to the productivity distribution,
2. the net income distribution aspired by the policy maker (‘society’),
3. the actual net income distribution, resulting from taxes imposed (including e.g. the social security ‘insurance’ payroll tax) and from expenditure.
There is early recognition in the literature of the need for heterogeneous labour in discussing dynamics. For example, 20 years ago, Solow (1976:152), occasionally but not consistently using the more accurate term ‘surface’:
“George Perry, who was one of the earliest quantifiers of the Phillips surface, has recently produced an alternative explanation of great interest [reference]. Perry’s basic insight is that the aggregate unemployment rate may be an ambiguous measure of pressure in the labor market when the composition of the labor force and of the group of unemployed is changing. (...) In other words, the Phillips curve would have shifted upward. (...) Perry quantifies this observation by making the plausible assumption that an unemployed body generates downward pressure on the wage level proportional to the amount of “unemployed labor” he or she represents. In turn, the amount of unemployed labor can be measured by the number of dollars of wages it represents.”
No economist working in the field and worth his salt will have neglected Solow’s paper. Issues of the substitutionability of one kind of labour for another, and of dispersion measures for the differences in responses, can found even earlier in the literature.
Van Praag & Halberstadt (1980) present a continuous productivity distribution.
Bruno & Sachs (1985) give a standard reference for stagflation. Their formal analysis uses homogeneous labour and proportional taxes, though some of their statements allow for an interpretation of heterogeneity and nonproportionality.
The need for modelling heterogeneous labour and nonproportional taxation is clearly recognized in the literature, see e.g. Beenstock et al. (1987) and Minford & Ashton (1993). Layard, Nickell & Jackman (1991), another standard, allow for heterogeneous labour, yet tend towards proportionality in taxation.
In addition, these references use dynamics but do not explicitly discuss the consequences of changes in tax parameters. Auerbach & Kotlikoff (1987) give a wealth of information on fiscal dynamics but do not specifically tackle stagflation.
Other references which put the Phillipscurve in perspective are Okun (1981), Blanchard & Fischer (1989), Friedman (1991), The Economist (1994) and Phelps (1994). Extensive theoretical and empirical work has been done by the Central Planning Bureau (1992a&b), Gelauff (1992) and Colignatus (1992b).
It is useful to recognise some current views on the labour market and the influence of taxes. This allows us to better see the impact of our new analysis.
There exists a simple popular view that makes two errors:
· it is static and not dynamic
· it assumes homogeneity and not heterogeneity.
This model is the comparative statics model with homogeneous supply and demand for labour. Borjas (1996:159), Mankiw (1998:125) and The Economist of February 26 1994 present that model. As a model it of course is consistent and it can help us to get our thoughts started, but as a representation of real markets it is erroneous.
Figure 19 gives the wage W on the vertical axis and supply and demand quantities on the horizontal axis. (Note the causal order.) It must be mentioned that marginal tax rates have played a role in the deduction of the supply and demand curves.
In this Marshallian model, the original equilibrium is attained at the intersection of the LS and LD curves, at wage W° and employment LE°. An income tax causes workers to demand a higher wage, and supply shifts up, to LS1. Premiums that raise wage costs for employers cause these employers to offer a lower direct wage, and demand shifts down, to LD1. The new equilibrium of LS1 and LD1 is LE < LE° where employers pay direct wage W1 > W° and where workers receive net W2 < W°.
For this model, with supply and demand schedules derived with marginal analysis of utility and profits, there is an important role for statutory marginal tax rates. First best here are lump sum taxes and zero marginal rates.
Figure 19: Statics
Marshallian model for the influence of the tax wedge

There are clear objections to this model:
· It is comparative statics, with homogeneous and flexible labour.
· It concerns any kind of tax, while some taxes are socially desired and generate employment. The model doesn’t distinguish between optimal and suboptimal taxes.
· Empirical research shows that labour supply elasticities are low. Elasticities are higher for partners, but that is less relevant here. People are very much in the position that they have to work for a living, and taxes generally pose no restraint on the availability for the labour market. This means that LS ~ LS1 ~ vertical. (Borjas (1996) shows this graph too.)
· The model does not really allow for unemployment. We might define U = LE° -LE, but LE° is an unobserved variable. Firms and workers react to observed variables, and in those terms there is full employment. Even if labour would be inflexible in this model, then there still would be no involuntary idleness at the net wage earned.
The use of this model thus is limited. Mankiw (1996) correctly presents the model as a ‘tax incidence’ model, and we should be hesitant of other conclusions.
The Simple View however regards this model as a real description of real labour markets, and it thus makes the category mistake of using arguments concerning the income distribution for issues of growth and employment.
The reader is advised to read again Chapter 2 of Keynes’s 1936 General Theory. The General Theory is in my perception an effort to seriously develop dynamics. Keynes’s precursors did discuss dynamic developments, but always ended up in static modelling. See also Patinkin (1976:140 footnote 4).
In the following quote, Keynes discusses a real wage reduction caused by prices. For our purposes, we might substitute a real wage reduction caused by taxes.
“To sum up: there are two objections to the second postulate of the classical theory. The first relates to the actual behaviour of labour. A fall in real wages due to a rise in prices, with money-wages unaltered, does not, as a rule, cause the supply of available labour on offer at the current wage to fall below the amount acually employed prior to the rise of prices. To suppose that it does is to suppose that all those who are now unemployed though willing to work at the current wage will withdraw the offer of their labour in the event of a small rise in the cost of living. Yet this strange supposition apparently underlies Professor Pigou’s Theory of Unemployment [voetnoot] and it is what all members of the orthodox school are tacitly assuming.” (Keynes (1936:12-13)).
Note, by the way, that the format of Figure 19 can always be used in terms of the average wage W. So the format of Figure 19 may be inviting to our intuition, in that we think that we indeed can draw a diagram like that, but we then should be aware that our true model is heterogeneous labour and not homogeneous labour.
An alternative view is more empirical, thus inherently more dynamic, and builds on Keynes’ observation. Empirical research, see e.g. Ashenfelter & Layard (1986), Theeuwes (1988), Hum & Simpson (1991) and Gelauff (1992) shows that marginal tax rates have ‘surprisingly’ low elasticities. The reason for a lesser importance of marginal rates is that labour supply is not flexible, but rather fixed. That labour supply is primairily given by demographic factors, is for example a well known assumption of practical models developed at the Dutch Central Planning Bureau. In Western economies people will have to become active on the labour market in order to earn a living, and taxes hardly form a barrier. People are still very much like Marx’s proletariat, and they have little else to fall back on but to supply their labour. There is some choice for partners and for people on benefits, but this does not have a major impact. For the majority, if anything, the average wedge is more important than the marginal one, see Den Broeder (1989). Recently Minford & Ashton (1993) see scope for a larger effect of marginal rates, but, their study is still far from explaining stagflation, partly for the reason that it is not fully dynamic.
By consequence, the major equilibrating forces exert themselves on the wage and the related employment. Here arises the dynamic situation of (wage) inflation and unemployment, and thus the issue of the Phillipscurve. Thus, conceptually, tax rates have their major impact not on labour supply but on the Phillipscurve.
The next question then is whether their effects are positive or negative. The common argument is that a higher marginal rate fuels inflation. Whether this is the case then becomes the next issue.
Before we can continue the discussion, a note on the ‘efficiency wage theory’ is required. The idea is here that, though people are forced to work to earn a living, they still can choose whether they shirk or not. They take account of a probability of getting caught and getting fired, but supervision would be expensive, and, if fired, one eventually could find another job. Unemployment then is required to discipline the workers. Borjas (1996:459) provides an introductory discussion, and the graphs are quite similar to the supply and demand schedules of old.
I tend to regard this approach as an example of academic excess. This may be an error on my side, but let us look at some of the arguments: 10% of the European labour force is unemployed, hence Europeans apparently shirk a lot ! And employers are so dumb that they cannot think of cheap ways to determine productivity, like setting standards and such. Agreed, shirking is undoubtedly a phenomenon, and eventually the superior economic model will include a subtle relationship between wage, effort and productivity to determine the last digits, but all this is less relevant for the Great Stagflation and the need for an Economic Supreme Court.
Graafland (1990) introduced another approach at the Dutch Central Planning Bureau, and he refers here to Hersoug (1984). The Phillipscurve here is derived using a model of wage bargaining between unions of employers and employees. The approach is adopted by Gelauff (1992) on the CPB model MIMIC, Gelauff & Graafland (1994). It recently is refined by Graafland and De Mooij (1998), Bovenberg, Graafland and De Mooij (1998), Jongen and Graafland (1998), Graafland & Huizinga (1999), [77] Graafland and Nibbelink (1999), Oers, De Mooij, Graafland and Boone (1999), and De Mooij (1999). In this approach, a higher statutory marginal rate actually increases employment, instead of reducing it as the Simple View and many standard Phillipscurves would hold. The mechanism is as follows:
· A higher marginal rate (under constant average) penalizes wage demands, lowers such demands, reduces (wage) inflation and thus increases employment.
· A higher average rate (under constant marginal) causes compensating wage demands at the margin, and reduces employment.
These properties actually are well known, as they are consistent with analyses concerning a Tax-based Incomes Policy (TIP). For example the Congressional Budget Office (1977:119):
“In recent years there have been proposals to use tax incentives and other schemes to encourage more moderate price behavior. (...) Rather than overriding market forces, these newer proposals attempt to take advantage of market incentives by making moderate price and wage increases a matter of self-interest for firms and employees. The best known of these proposals involves tax incentives to reward or penalize wage decisions that deviate from some established standard.”
This view however still does not take account of the dynamic marginal rate. There are also the issues of labour heterogeneity and optimal taxation that we have encountered in discussing the Simple View, but that have not had sufficient attention. These issues will be discussed below.
Given more than one view, there is scope for confusion. This has in fact occurred.
· The OECD policies referred to above, directed at lowering statutory marginal rates, have been advocated using the rhetoric of the Simple View even though economic advisers often are aware of the Complex View.
· If one would really think that high marginal rates reduce work effort and supply, then a situation of high unemployment would call for higher rates - that would reduce unemployment. Policy however has been to reduce rates.
Secondly, when these views are confronted with the effects of the policy of rate reduction, there again is ample scope for confusion.
When unemployment has been reduced, then this is being seen as corroboration of the Simple View. For example the data on the US now show the combination of a reduction of taxes on higher incomes and some reduction of unemployment, and it will now be difficult for policy makers to accept other lines of arguments. Actually, in so far as there has been some success in practice, it is because the policies have also lowered average rates. Higher budget deficits have been relied on to pay for additional benefits and average rate reductions for higher incomes. The reduction of marginal rates actually had a negative impact.
In most cases unemployment has remained high. In this case one should expect that policy makers would reconsider their views. They don’t seem to do this, and rather look at the few cases where there seems to have been success along the expected pattern.
A specific example is the Dutch 1990 tax reform (known as “Oort reform” [78]). This reform was supported by computations using the MIMIC model, see Gelauff (1992). The reform reduced both marginal rates and exemption. The reduction of statutory marginal rates reduced Phillips curve sensitivities, and induced larger wage claims and lower employment. The reform however also included a reduction of average taxes, and this caused employment to rise on balance. We may restate the situation in more mundane terms: the reduction of average taxes was sold on the political market as a reduction of marginal rates. Politicians had their eyes fixed on the reduction of marginal rates and the reduction of unemployment, and they got what they wanted to see, without realising that the mechanism in MIMIC was entirely different, and that proper exploitation of this mechanism would lead to even lower unemployment.
We will first discuss heterogeneous labour supply, and forward a hypothesis on its distribution. Note that supply is difficult to observe, since generally we only observe actual employment, which is the minimum of supply and demand. However, data on actual earnings do allow the encouraging conclusion that the earnings distribution can be approximated by a lognormal distribution. For an indication we look at Dutch data on the distribution of income in 1950 and 1988. We complete this chapter by a more thorough sets of definitions for earnings, cost and income accounting, and we construct integrals that are relevant for the minimum wage.
Let us first regard labour supply.
At a Dutch economists “Masterclass” session in Fall 1991, Orley Ashenfelter explained that labour supply was unresolved and actually some kind of a researcher’s nightmare. In a break I put my suggestion on the blackboard, and my ‘quiggly’ line (see below) at least drew the compliment of an amused smile. I almost put this suggestion into Colignatus (1994a), but backed away from that since it was not essential for that paper (and I used only the normal right hand side of the supply graph). However, to my surprise and pleasure I saw that same quiggly line in De Groot & Keuzenkamp (1995) who discuss results of Quah (1993).
De Groot & Keuzenkamp have another subject than labour supply. Their problem is whether international economic growth results into convergence, as Adam Smith’s “The Wealth of Nations” seems to imply. De Groot & Keuzenkamp refer to the results of Quah (1993) who has compiled the distribution of output per labourer per country, which turns out to be that quiggly line.
To understand the point, let me first explain my reasoning on labour supply. At low productivity, one has to work 24 hours around the clock in order to survive. For example, if subsistence is at B and productivity is y, then the hours are B / y. Hours thus quickly rise when y drops (the working poor). When productivity increases, one quickly starts working less hours, particularly since the kind of work at that level often concerns hard labour. At higher levels of productivity again, the kind of work is less exacting and pay is better, and one may work longer hours again. However, at the highest levels of productivity, labour again becomes a relative disutility. In summary, when plotted in a graph, the figure looks like a dromedary, starting high at the left, having a dip in the neck, then the bump, and sliding away towards the tail.
If labour supply is like this, then it likely affects the productivity distribution across nations. While every individual has his or her own parameters, aggregation may average things out, and as a result one nation then may stand for a certain income group. Thus Quah’s finding is consistent with my intuition and indirectly confirms it.
Figure 20 plots the quiggly line, for imaginary income y in thousands of dollars and subsequent working hours per week, for both long and short ranges of income so that the curvature can better be appreciated.
Figure 20: Supply in hours per week, depending upon income

Note: These are not observations, just give an hypothesis on shape
I’m still working on a correct form of the complete utility function. Barro & Sala-i-Martin (1995) give a recent discussion of the trade-off of work and leisure in the context of growth, and that might be a fruitful framework. However, for the present purposes, our development may stop here.
The literature on the distribution of income has resulted into a general impression that this distribution can be approximated by a lognormal distribution, see e.g. Pen & Tinbergen (1977). For the purposes of our exposition it is useful to test this impression. [79] Also, since we will discuss long periods of indexation, notably from 1950 till 2002, it is also useful to look at the distribution in 1950 and a recent one. We then take the distribution data in the appendices for Holland 1950 and 1988.
Figure 21 and Figure 22 plot the resuls of a (rough) estimation. It appears that we get the best fit when we transform the data into logarithms (and recompute the frequency densities - i.e. the transformation required to deal with different class sizes). The logarithmic data are approximately normal, as can be seen in the plot of log[income] versus its frequency density. We can transform the estimated distribution for a plot in the income-frequency format.
Figure 21: Dutch income distribution 1950

Figure 22: Dutch income distribution 1988

In the 1988 plot, the estimation has been done with the 1988 ‘parttimers’ dropped, but they are included again in the income-frequency plot so that we can better appreciate that their inclusion would confuse a discussion on fulltimers. But it is nice to see the dromedary shape returning.
We conclude that income can indeed be approximated as a lognormal distribution, and throughout time; at least as a stylized fact that we can use for propositions and illustrations. [80]
There are some useful definitions and formulas for heterogeneous labour markets. These hold for any distribution, not just the lognormal distribution. Let y and w be micro values that have a certain density. First of all, there are the following accounting definitions, for annual and nominal values:
·
=
the profit rate, expressed as a markup on labour costs
·
y = labour costs + profit = w (1 +
) = product revenue =
productivity
·
labour cost quote = LCQ = w / y = 1 / (1 +
)
· labour costs = w = (direct) wage + nonwage (but labour related) costs
· w = net labour income + (direct + indirect) taxes + premia + other nonwage costs
· tax = T[w] = (direct + indirect) taxes + premia
· gross labour income = labour costs - other nonwage costs = net labour income + tax
· Neglecting the “other nonwage costs” gives w = labour costs = gross labour income. (Thus the w are labour earnings only if the other nonwage costs are zero.)
Observed labour costs have a density fw[w].
Since the product is y = w (1 +
), equalisation of profit rates with respect to
labour would give the labour cost density fw[w] as a shift of the
productivity density fy[y]. Normally, though, the profit rates
are equalised in terms of capital, which for example causes different Labour
Cost Quotes (LCQ) per sector of industry, and then the relation between fw[w]
and fy[y] is a more complicated affair.

The proper labour supply density sp[.] depends on net labour income (w - T[w]). But supply can, with the neglect of “other wage costs”, be regarded as a function of labour cost w, as:
s[w] = sp[w - T[w]]
Labour demand is a density d[w]. Total supply follows from the integral:
& 
The employment density is the minimum of supply and demand, and equals the observed labour cost density:
e[w] = Min[s[w], d[w]] = fw[w]
For total employment we take account of a minimum wage M.
![]()
For the discussion below it is also useful to compute aggregate labour costs and its (nominal) tax revenue:
![]()
![]()
Important are the average wage W = WT / LE and the average tax rate ATXR = TAX / WT (when we can neglect other nonwage costs).

Densities for unemployment ud and vacancies vd follow from the difference between supply and demand and actual employment:
ud[w] = s[w] - e[w] & vd[w] = d[w] - e[w]
The aggregate unemployment and vacancy are U and V, and their rates are:
u = (LS - LE) / LS = u[M] & v = (LD - LE) / LS = v[M]
Figure 23 gives the stylized fact that vacancies tend to occur at higher income brackets and unemployment at lower ones. The figure is quite stylized, since it is a difficult issue to construct plausible s[w] and d[w].
Figure 23: Supply and demand of labour


If labour supply LS was homogeneous, we would have difficulty explaining that u LS would be unemployed, since these persons are similar by assumption. Basically then u is a probability.
For heterogeneous labour we could use characteristics and a mechanism that explains why some are employed and others not. This mechanism could be related to the shift of the densities over time due to aggregate demand, inflation, technology, job changes and the like. In fact, we would use such methods to determine ud[w] and vd[w] in practice - and perhaps we would not start with w as the defining characteristic, but start with other characteristics and work towards the wage. However, we will not look into this deeply. We will use heterogeneity mainly to explain the effect of the minimum wage. For a level of income above the minimum wage we again assume some probability, quite analoguous to the homogeneous case. Basically, an agent has offers for various kinds of jobs and incomes, and associated probabilities (and one for unemployment). The s[w] and d[w] thus have a stochastic base.
Minimum wage unemployment differs from the ‘normal’ unemployment above the minimum. Thus:
u = um + un

Only part of um can be gainfully employed when the minimum wage would be abolished.

Only un will exert a meaningful pressure on wages. A major dynamic process is that um rises over time, contributing to the phenomenon of hysteresis. Labour market processes and wage settlements might stay stable in terms of un, i.e. the “normal” unemployment rate, but they shift in terms of u, the overall unemployment rate.
One may wonder why M is nonzero, when its abolition would create employment ume. The apparent reason for governments is that labour markets are not fully competitive and require some regulation. This issue is taken up again in the next chapter on subsistence.

These integrals don’t say how large the densities are. An indication of how much M ‘bites’ is difficult to find. An approach is the following. Let us define ms such that (for example) 1% of supply has an earning power of less than ms. Similarly, md for demand. Then Table 7 distinguishes six situations. [81]
Table 7: Combinations of ms, md and M
|
|
ms < md |
md < ms |
|
Minimum wage irrelevant (M < md) |
M < ms < md |
M < md < ms |
|
Minimum wage irrelevant (M < md) |
ms < M < md |
|
|
See point (b) below. |
|
md < M < ms |
|
See points (a) and (b) below. |
ms < md < M |
md < ms < M |
There are some notable effects:
(a) On the supply side, if ms < M, then would-be earners of ms < w < M become eligible for benefits. When they accept these benefits voluntarily or from social pressure, they, in a sense, form no real supply. Yet they are supply, otherwise they would not be eligible for a benefit.
(b) On the demand side, if md < M, then there would be a real demand for md < w < M if government would reduce M. But this demand is not relevant when M exists.

A crucial point to see is that, as we here are concerned with productivity, that we can use subsidies to manipulate the densities, for example by subsidising a particular industry or profession. Doing this of course causes an accounting problem: does the w on the horizontal axis measure productivity before or after such subsidy ? The most practical approach is to use w inclusive of subsidies - because market measurements are always inclusive. Subsidising firms would allow them to hire at higher wages: this would shift d to the right. Subsidising workers would allow them to work for lower wages: this would shift s to the left. What happens to employment is not a priori obvious.

It turns out that the minimum wage is important in practice. Our analysis will strongly rely on minimum wage unemployment. In this we differ a bit from the original position taken by Keynes. As Tobin (1972: 122) states:
“But why is the money wage so stubborn if more labor is willingly available at the same or lower real wage ? Consider first some answers that Keynes did not give. He did not appeal to trade union monopolies or minimum wage laws. He was anxious, perhaps over-anxious, to meet his putative classical opponents on their home field, the competitive economy.”
In my view, Keynes’s argument (as further explained by Tobin) is to the point, and aggregate demand, sticky wages and the co-ordination failures on these are established concepts in macro-economics. However, the record of the Great Stagflation is very much influenced by the minimum wage problem, and thus it is that kind of analysis that merits our attention here.
With respect to the textbook macro-economic model in chapter 23, we can introduce a minimum wage component in unemployment uM that can rise gradually over the long run due to taxation. With u = uM + uR (R from ‘remainder’) a possible Phillipscurve with less dampening effect of uR is:
dLog[P] =
dLog[P]* -
Log[ (uM + uR) / u* ]
Alternatively, the two submarkets have their own curves. In both cases, it must be determined how the two submarkets develop and how they interact. The most obvious hypothesis is that high productivity labour sets the trend for the development of wages. When minimum wage unemployment rises stronger than general unemployment, then the higher educated have more scope for wage demands, and then there is an upward effect on wages and prices, even stronger so when price expectations come into play. This would show an unfavourable (upward or rightward) shift of the (aggregate) Phillipscurve.
This chapter is a bridge between the standard macro model and the elaborations on heterogeneous labour and taxes. The concept of the ‘welfare state’ depends upon our concept of subsistence and the elements that go into its index , and on the decisions that we take on this at the national level.
In Book III we already regarded some indexation of subsistence and taxes. Here we will refine indexation of net subsistence. Gross subsistence will be T -1[B] as determined by the tax system. A way to understand this chapter is that it formulates conditions for the tax system.
We already saw two possible indexation schemes for subsistence: (i) on average net income or (ii) on gross average income. The latter presumes that taxes are an indication of welfare too. This current chapter will look an another way of indexation that takes an intermediate position that might be better but that might also be needlessly complex.
We will find that if we adopt certain indexations, then we must accept some divergence in development in other terms.
Subsistence labour forms a special group within heterogenous labour. The group only exists if we acknowledge heterogeneity. In the labour supply density we already hypothised a ‘dromedary shape’ that partly reflected the fact that a minimum income means longer hours when the wage drops. Let us now discuss subsistence more extensively.
With man a social animal, sociobiological and social psychological causes apply in general. Precisely what these causes are, and how they apply, is a subject of serious study, see for example Aronson (1992a&b) and Wilson (1993). A regularity for mankind seems to be, vide these studies, that in certain cases people show a certain amount of care for their fellows.
This care should not be overrated. Part of it may not be empathy, but simply be precaution and an insurance for the event of personal misfortune. Also, some care obviously reduces the chance of a violent reaction of the disadvantaged. There are clear examples of empathy breakdown. For example, archeologists found ancient mines with such small shafts that these mines could only have been worked by children. We need not have illusions about working conditions, especially since it were lead mines. Nevertheless, whatever these clauses and contrary cases, ‘normal conditions’ seem to provoke a distinct level of care.
A strong assumption is that people have views about the whole income distribution. A simpler assumption is that people recognise a level of subsistence - which for dynamics likely implies that they adjust that subsistence to developments.
The strong assumption might well be that the income distribution is lognormal for social - and not ‘economic’ - reasons, and that the economic process only is oriented at directing people to a fitting place in that distribution. Economic productivity is essentially a nominal concept. It is not just the technical amount of goods per hour that can be produced, but also multiplied by the price of the product, and the price is determined in a social situation where status considerations apply. The assumption that economic agents have views about the income distribution actually need not be overly strong. As Tobin (1972, p122) states:
“(...) This observation led Keynes to his central explanation: Workers, individually and in groups, are more concerned with relative than absolute real wages.”
However, for our discussion, we narrow down the problem to the subsistence or the net minimum wage, and disregard views on the whole income distribution.
Suppose that a group recognises some subsistence. A group even might be defined by its shared views on this. For example, members of a royal family receive a certain allowance that meets their standard of living, and their standard of living helps to show that they are members of that royal family. The view oriented at the inner group thus is linked to the exclusion of others. Others should have less, precisely to distinguish them from the inner group. Being a royal family does not amount to much, if you don’t have subjects. This process works all the way down, so that even people in minimum conditions flatten out differences among themselves, and seem to compare themselves to beings of assumed lesser stature. (So the simpler assumption could be used to build the strong assumption.) This discussion also clarifies that the size of the group matters. There is only room for a national subsistence floor if the simpler assumption allows for a large group. So the simpler assumption properly reads that groups not only define subsistence for the inner group, which is less controversial, but also, more controversial, subsistence for society as a whole.
Note that any assumption, simple or strong, is not sufficient by itself. Society also has the coordination problem of aggregating the individual preferences on national subsistence, particularly since not everyone who wants to raise the living standard of the poor has the personal means to do so. Sometimes there are legal rules. Often labour unions come in. For example in Holland collective bargaining results into industry minimum wages that are on average at least 10% higher than the legal minimum wage. More generally, subsistence is simply a social convention. A certain level of living is regarded as inacceptable, both by most employers and by the work floor in general.
One way to implement a welfare system would be to set social security at B, and leave it at that. There would be no need for a minimum wage, since employers would have to offer at least B. In practice government nevertheless create a minimum wage system too, and allow a gap between the working wage and the benefit. One of the reasons is better control, so that agents are less likely to both receive a benefit and work on the side. One of the other causes undoubtedly derives of the social forces that call for a decent minimum. [82]
Sometimes labour market regulators may be aware of the problem of the minimum wage, and may opt for a lower indexation of M even though it results into a lower B. But the effectiveness of such policies that reduce subsistence depends upon the strength of conventions in all factories and sectors.
It is useful to note that conventions are sensitive to various considerations. For example, the Dutch legal minimum wage holds for fulltimers, but does not hold for parttimers. Holland now has a lot of parttime work. [83] It is also interesting to observe that tax exemption x is established within the bureaucratic realm where there is no direct confrontation with the standard of living. For its own historical reasons, exemption is generally indexed on inflation. These matters, while also being evidence that human care for other people should not be overrated, again clarify that our subject matter is not simple in itself. Subsistence itself is very simple, especially to those who are subject to it, but it can be made complex, especially by those who govern.
Economic theory has long been aware of notions of empathy, vide Adam Smith (1759, 1984) on moral sentiments.
Some tax theorists suggest that the social subsistence level should be exempt from taxation. Hofstra (1975) recalls the Cohen Stuart 1889 analogy, that a bridge must hold its own weight before it can be used.
In his 1980 presidential address to the American Economic Association, Solow (1980) discussed his reading of Pigou’ work, and writes:
“The last comment of Pigou’s that I want to cite is especially intriguing because it is so unlike the sort of thing that his present day successors keep saying. Already in the 1933 Theory of unemployment he wrote: “... public opinion in a modern civilized State builds up for itself a rough estimate of what constitutes a reasonable living wage. This is derived half-consciously from a knowlegde of the actual standards enjoyed by more or less ‘average’ workers ... Public opinion then enforces its view, failing success through social pressure, by the machinery of .. legislation” (p.255). A similar remark appears in Lapses [Pigou 1944 Lapses from Full Employment]. Such feelings about equity and fairness are obviously relevant to the setting of statutory minimum wages, and Pigou uses them that way.” (p5)
Solow in the next sentences also emphasises the power of social pressure, and shows himself aware that the minimum wage need not be a special application since social pressure is abundant:
“... it is even more surprising ... that employers so rarely try to elicit wage cutting on the part of their laid-off employees, even in a buyer’s market for labor. Several forces can be at work, but I think Occam’s razor and common observation both suggest that a code of good behavior enforced by social pressure is one of them.”
We already have encountered these indexes of subsistence:
· The graphs in Book III are based on indexation on the net average wage Net[W] = W - T[W]. This presentation has been chosen since its approach is more conservative.
· Another indexation is on W itself, which thus considers taxes a part of well-being. Property (13.3e) however shows this equivalent to the first, for the Bentham tax, provided that exemption is properly indexed too.
Indexation on gross income (i.e. on W) agrees better with economic intuition, since taxes need not be a real burden, when they generate goods that enter the utility function. However, some taxes can be wasteful or can be discarded for other reasons. In the following we will take a middle position, adding and substracting income elements. In particular:
· some public goods Q are provided by nature: breathing air and the berries in the field
· taxes go into public goods Gp, that subsistence workers get for free too (as licensed free riders)
· some government expenditure Gs may benefit only special interest groups (wastefully)
· some government expenditures Gn actually benefit the average tax payer, and should be considered part of ‘net income’
· some taxes go to the support of the unemployed - B U - which the unemployed cannot provide for themselves
· there is the possibility of different consumption baskets (different deflators)
· it is recognised that people at subsistence tend to have more sweat and less leisure
· tax revenue can change disproportionally with income.
Considering these element, it seems that the adoption of a detailed index would likely cause little difference with gross income indexation. Many of the additions compensate for many of the substractions. Also, if subsistence were to lag behind average income, then it might well happen that subsistence is increased at some point anyway.
It nevertheless remains useful to develop the detailed index formally. If your interest in the subject is not very strong, you are advised to skip the remainder of this chapter. The reader who studies this section will notice that we do not achieve very much. Some of the formulas look complex, but on close inspection only say the obvious.
We assume a ‘basic insurance’ setup for social security. The unemployed get a benefit of B. At higher earning levels they may have additional insurance, and be paid on top of B. But this is of no concern for our issue. Also, who is on benefit but gets a job offer, accepts this, on the penalty of losing the benefit anyhow. This means that nominal transfer payments are NTRF = B U. We also take b = NTRF / LE = B u (redefining the symbol b - no longer the IS curve). Similarly q = Q / LE.
Let g = NG / LE be average nominal government expenditure per worker, with g = gn + gp + gs. We will assume Ricardian equivalence, so that government budget deficits are regarded as part of taxes, so that there effectively is no deficit. [84] Hence TAX = NG + NTRF.
Then the average wage tax rate AWTR
TAX / WT =
(TAX/LE) / (WT/LE) = (g + b) / W.

For the special interests we distinguish two kinds of situations.
· When average income itself is the special interest, then gs can also be regarded as net income, part of gn, and then this case is equivalent to gs = 0. Note that we could include gn in Net[W] mathematically anyhow (but don’t do this for clarity).
·
Alternatively gs
0. In particular, the average income group could be a victim
of a coalition of the poor and the rich, the first getting a high B and
the second a large gs. [85]
In a democracy with voting population LS, a majority of LS/2 + 1
indeed can levy high taxes on the other LS/2 -1. In that case it would
not be fair to regard the tax on the average wage as beneficial to the common
good. (Note that this analysis for gs
0 is weak, since not all possible redistributive
schemes are considered.)
Price indices for the average and subsistence workers are P and Pb. Real positions thus are W / P and B / Pb. Government prices are Pgn, Pgp and Pgs, giving gnr and gpr and gsr. Similarly Pq and qr.
The difference in leisure and sweat will be compensated here by choosing a suitable Real Income Ratio RIR.

All together, we have:
· Net position of the average worker Net[W] + gn + gp + q
· Net position of the subsistence worker B + gp + q
·
The real income ratio RIR
(B/Pb + gpr + qr) / (Net[W]/P
+ gnr + gpr + qr)
The government would set RIR at a specific value, and then determine B from the other values:
B = Pb { RIR (Net[W] / P + gnr) - (1 - RIR) (gpr + qr) } (27.1)

One thing to show is that B has a small multiplier on itself because of b. We can use the average tax rate difference Z between national and private average:
Z
TAX / WT - T[W] / W
Z = (g + b) / W - T[W] / W
T[W] = g + b - Z W
Net[W] = W - T[W] = W + Z W - g - b = (1 + Z) W - g - b
Using this for the RIR:
· Net position of the average worker (1 + Z) W - gs - b + q
·
Then RIR
(B/Pb + gpr + qr) / ((1+Z) W/P - gsr - B/Pb u +
qr)
(27.2)
The first term of (27.2) contains a small (negative)
multiplier of B on itself. In full employment, u
0.02, and with RIR
0.30 the multiplier might
easily be neglected. That is, neglected in (27.2) but not for the determination
of the RIR in the base year - since B u cannot be neglected for the base
of the RIR. Since (27.1) and (27.2) are mathematically the same, using (27.1)
makes that the question of neglecting that small multiplier does not arise.

Another point is that the index becomes simpler if all price
indices are the same. Taking P = Pi gives RIR
(B + gp + q) / ((1+Z)
W - gs - B u + q).

Let us consider a numerical example. Suppose that gn = gs = q = 0 and that prices are equal. Suppose also that AWTR = TAX/WT = 0.30. We also take the Bentham tax T[y] = Bentham[y] = 0.5 (y - B). Let us consider the path that subsistence is half of average income, i.e. B/W = ½, and then compute the various ratios. Then:
· Indexation on gross average income gives B / W = 0.5.
· Indexation on net average income gives B / Net[W] = B / (2B - 0.5 B) = 0.66.
·
Then T[W] / W = 0.5 (W - ½ W) / W =
0.25, and Z
0.30 - 0.25 = 0.05.
· Since gn = gs = 0, g = gp, and AWTR = (gp + b) / W = gp/W + ½ u = 0.30. If we assume full employment u = 0.02, then gp/W = 0.29.
· Then RIR = (B / W + gp /W) / ((1 + Z) - 0.01) = (½ + 0.29) / 1.04 = 0.76.
Note that the ratio numbers 0.50, 0.66 and 0.76 by themselves mean little. In both cases B is set at half W, so the value of B is not affected. The only point is that the bases are different each time, and apparently smaller. These bases of course change again for other assumptions on the various variables and functions. Where there is no difference at a particular moment (base year), there however arise differences over time. The following tries to find out more about this.
One way to trace developments over time is to make
plots as
we did in Book III. Another approach is more formally, and a commonly
used
route here is the assumption of a constant macro-economic progression
factor. This factor is the elasticity of tax revenue with respect to
income (Koopmans (1975:103)), thus mepf = (Y / TAX) (
TAX /
Y). The factor is determined by tax
parameters, their indexation, the income distribution and its change. In this
case, without a deficit, the progression factor applies to expenditure too,
which may be taken to mean, effectively, that taxes are indexed such that tax
revenue follows expenditure.
We shall take the progression factor for the average wage, which is exclusive of profits and the growth of employment. Thus our
= (W / g) (
g /
W). We assume a
nominal position, thus include price developments in government expenditure
relative to the average wage. We set gn = 0 now, since it can be
included mathematically with gp. We also assume that
is equal for gs
and gp, so that gs = gs[0] W
/ W[0]
= gs0 W
and gp =
gp[0] W
/ W[0]
= gp0 W
. Thus g
= g[0] W
/ W[0]
with properly g[0] = gp[0]
+ gs[0].
Then
g
/
W
g / W =
NG / WT. This has
the specific property that
= 1 implies that the quote g / W = g[0] / W[0] is constant,
and thus NG /WT is constant too. We will use this property below.
Taking W separate:
and hence
(27.3)
Inclusion of the progression factor does not cause special observations yet. If
< 1 then in the limit of W the indexation can be rather simple,
especially if Pb qr / W goes to zero too. If
> 1, then there could be a point
where the markup on W is zero, or subsistence would have to be zero -
which would suggest an unrealistic tax function. The progression factor becomes
more useful if we regard special cases.
Definition: A (democratic) state is “Madisonian”, iff gs = 0. James Madison remarked that a proper democracy with a majority rule actually safeguards the interests of the minorities.
Definition: A “real welfare state” aspires at a constant RIR and takes q = 0. The idea on the latter is that breathing air is prerequisite to utility and no source of it. The berries in the field are owned by someone, and no longer free. (If they were free, then Coase’s Theorem shows that they could be counted as part of income, and hence they would no longer be free for all practical purposes.)
Definition: A “pragmatic” real welfare state sets u = 0 in the determination of the benefit level and RIR. The factor B u really does not amount to much.
Definition: “Uniform prices” means P = Pb = Pgs = Pgb = Pgn = Pq. If this happens then one price index P suffices.
Theorem B1: In a pragmatic Madisonian real welfare state with Ricardian equivalence and uniform prices, (i)
RIR = (B + g) / ((1 + Z) W) (base year)
and
B = W ((1 + Z) RIR - NG/WT) (henceforth)
(ii) If RIR is constant, then: (1) A constant quote for
government layouts (or progression factor
= 1) only allows for some variation in B/W by
variation in the average tax rate difference Z. (2) If Z is
constant, then B is fully indexed on W.
Proof:
(i) For the base year: substitute the results of the definitions in the RIR (vide (27.2)), note that the prices cancel and that g = gp. Then find the base year result as stated, and then use (NG /WT) W = g to get the annual expression.
(ii) For (1), we use
= 1
NG /WT = g[0] / W[0] from above. Then
simply rework the equation for a constant.
For (2), if NG/WT and Z are constant, write B
= c W. Then
B
/
W = c
= B / W. Hence
Log[B]
=
Log[W].
Q.E.D.
Theorem B2: In a pragmatic Madisonian real welfare state with Ricardian equivalence and uniform prices, net income indexation is only feasible for special tax functions.
Proof: To see what happens if B is indexed on Net[W],
write n =
Net[W] /
W.
Note that 1- n is the marginal tax rate for W, and that
B /
W =
B /
Net[W] n.
With B = W (1 + Z) RIR - g (theorem B1) use W (1 + Z) = (Net[W] + g + b) and get:
B = RIR Net[W] - (1 - RIR) g + RIR b
Note that b
0, since we have set u = 0 only in the determination of
the RIR. Then:
B /
W =
(RIR Net[W] -
(1 - RIR) g + RIR b) /
W
= RIR n -
(1 - RIR)
g / W +
RIR u
B
/
W
B /
W= (RIR n -
(1 - RIR)
g / W)
/ (1 - RIR u)
We again find a small multiplier. Dividing by n gives the transform to Net[W]:
B /
Net[W] = (RIR
- (1 - RIR)
NG /
WT / n) / (1 - RIR u)
LogB /
Log[Net[W]]
= Net[W] / B (RIR - (1 - RIR)
g / W / n) / (1 - RIR u)
Indexation on Net[W] means that the left hand side is 1, and that Net[W] / B is some constant. Setting net income ratio B / Net[W] = NIR[0]:
NIR[0] = (RIR - (1 -
RIR)
g / W / n)
/ (1 - RIR u)
We want to find the conditions under which RIR is a constant (for the ‘real welfare state’). Solving above expression for RIR gives:
![]()
A special case has
= 1 and thus NG/WT = g / W constant,
and n constant, i.e. for the Bentham tax function n = 1 - r.
This is only feasible if u is constant too. There is a more general
class when
g / W
/ n is some constant, but u must be constant here too. In other
cases the RIR is implicitly adjusted to make B / Net[W]
constant. But nonconstancy of the RIR conflicts with above definition of the
welfare state (that must have constant RIR).
Q.E.D.
This chapter deals with the confrontation of labour supply with labour demand, and the equilibrating dynamics. With high unemployment, wage growth may be reduced. With low unemployment there may be ample room for wage demands, and wage inflation can rise.
Chapter 25 already provided a background discussion on the Phillipscurve, and for example pointed to Graaflands c.s. derivation from a Nash maximising framework. In this chapter we take that possible development for granted, and concentrate on concepts: what variables are relevant for a Phillipscurve, and how do we characterise equilibrium.
It appears to be useful to first develop some concepts of dynamics.
The Phillipscurve reflects the hypothesis that (wage) inflation is influenced by unemployment. Of course other factors are important too, such as (price, wage) expectations and forward shifting of taxes. Whatever other influences, the key notion of the Phillipscurve remains the influence of the employment situation. Wage adjustment now is considered to be the dependent variable while normally the price would be the independent variable. Wage adjustment will consist of a shift along a curve and a shift of the curve, and for both we still use the term ‘Phillipscurve’.
As remarked, labour supply is relatively fixed. Utility maximisation and rational calculation will primairily be directed at finding a competitive wage (competition not necessarily meaning full competition - as we e.g. referred to a Nash equilibrium). An individual who sets his wages too high will become unemployed. Even the probability of becoming unemployed will have a sobering effect. Given this framework, the model must concern a dynamic process of unemployment (threats) and wage adjustment.
First consider a homogeneous market with price level P. Price adjustment towards the market clearing equilibrium price P° depends upon excess demand, and since excess demand is determined by the price level, we get a differential equation:
P’ = dP / dt = f[ D[P] - S[P] ] = f ° [ P° - P ]
Note that the choice of ‘excess demand’ as the explanatory variable is arbitrary. We might as well take excess supply, or allow demand and supply to react differently, or have a different sensitivity to prices and quantities. Similarly, we can also take the quantity as the explained variable. And we can also formulate the equation in expectational variables.
Some authors hold that above relationship for price dynamics is an hypothesis that needs further clarification. I think that this is too cautious. Admittedly, it might be too simple to only presume that agents know that they are involved in a market ‘tatonnement’ process, and further explanations can be helpful. Agents have various tools available, and the choice of offering and accepting prices and quantities can be described, using an optimising framework. The speed of adjustment in markets depends upon characteristics like the size of the market, the historical relationships between agents, ‘menu costs’, and the like. It is also useful to distinguish ‘normal’ periods and ‘shocks’. However, the level of detail depends upon the use of the model, and above relationship suffices our goal.

Inflation is the rate of growth of prices, i.e. p = dLog[P] / dt = P’ / P. The change in inflation is dp / dt = P”/ P - (P’)2 / P2 in terms of the original price level. Acceleration of inflation would be d2 p / dt2.
We need to clarify a term. The economic literature uses the term “Non-Accelerating-Inflation Rate of Unemployment” (NAIRU) for that rate of unemployment that causes dp / dt = 0.
This term thus should be “non-accelerating prices” or “non-changing, or constant, inflation”.
Secondly, it appears that the formulation in terms of differentials is less useful for practical economics than the formulation in differences. So we will use differences instead. Inflation then is p = (P /P[-1] - 1) (often expressed as a percentage).
Thirdly, we regard wage inflation rather than product price
inflation, thus
= (W /W[-1]
- 1). Please note that we use the different letter font
for wage inflation, since we use w for
the level variable in densities like e[w]. Properly we should
substract productivity growth, but for our purposes we may now assume that
productivity is constant. Note that wage inflation can be different from price
inflation, since productivity is determined in terms of the output price, and
output will not be only consumer goods but also exports, investments and
intermediates.
We will use the term “Constant Inflation Rate of Unemployment” (CIRU) for that rate of unemployment that causes p = p[-1]. Similarly, the Constant Wage Inflation Rate of
Unemployment (CWIRU) gives that rate of unemployment that causes
=
[-1].
[86]

We use the term “Equilibrium Rate
of Unemployment” (ERU) for that rate of unemployment that causes wages to
adjust to their equilibrating or market clearing level
° = (W° /W[-1] - 1). The CWIRU might be a special kind of ERU. The idea is that once inflation has been constant for a long while, you start expecting it. Table 8 contains an overview of the concepts.
Table 8: Concepts for wage inflation 
|
|
|
REH: white noise surprise |
Non-REH: other surprises |
|
CWIRU
|
uf = ERU[FE] |
CWIRU = ERU[REH] = ERU[FE] |
Maybe temporarily, but impossible in the long run |
|
Other |
CWIRU = ERU[REH] |
Maybe temporarily, but impossible in the long run |
|
|
Non-CWIRU
|
uf = ERU[FE] |
|
|
|
Other |
ERU[REH] |
No equilibrium in any of these senses |
Note: We use ° to indicate market clearing equilibrium, and * or E[.] for expectations
and expectational equilibrium. We use · when we allow for either.
We can recognise at least two equilibria:
·
FE: full employment, when all labour resources are used except
for friction unemployment uf = ERU[FE]. Normally
° is
a direct function of uf, for example
° = h[uf, u[-1]] +
dLog[Money]. It may be that people’s expectations on nominal wages are
not fulfulled, so that
°
E[
]
. A
FE policy is only successful if
=
° and u = uf.
·
REH: the rational expectations equilibrium, when expectations are fulfilled except for random error. Thus
* = E[
],
it so develops that
=
* +
, and this optimality is only in terms of expectations. In ERU[REH] unemployment may be far from uf = ERU[FE]. The situation can be stable if people only regard the price signals (and whatever else is in the specification), and are satisfied as long as their expectations are fulfilled.
Let the change in wage inflation be sensitive to wages with
degree
and
sensitive to quantities with a function f[u], with u the rate of unemployment. The following gives a rich (wage) Phillipscurve that contains not only the
rate of unemployment but also past and (forward looking) equilibrating wage
inflation. [87]
-
[-1]
=
(
-
[-1])
+ f[u]
(28.1)
=
+ (1 -
)
[-1] + f[u]
(28.2)
Generally for the CWIRU from (28.1):
0 =
(
-
[-1])
+ f[CWIRU]
CWIRU = f -1[ -
(
-
[-1]) ]
According to the Rational Expectations Hypothesis (REH):
* = E[
] =
.
Then from (28.2) - interpreting REH as ‘model consistency’:
* = E[
] =
* + (1 -
)
[-1] + f[E[u]]
* =
[-1] + f[E[u]] / (1 -
)
(28.3)
We can also prove that u = E[u] and then define E[u] = ERU[REH]. [88] Hence: [89]
=
[-1] + f[E[u]] / (1 -
)
E[u] = f
-1[ (1 -
)
(
-
[-1]) ] = u
In this specification, the CWIRU can be ERU[REH], and ERU[REH] can be CWIRU. Namely, when
*
=
[-1],
or when
expectational equilibrium is associated with constant wage inflation.
Some ERU[REH] however can exist with nonconstant inflation that is not
CWIRU.
Since equilibrium wage inflation
* is
determined also by other factors such as money, the ERU need not be constant.
Even when u = ERU[REH] for each separate year, then
might still have an erratic development
over the years. Similarly, the CWIRU can be an ERU[REH], but need not be. It
can even be that
= E[
] but expectations are not REH - since
the error is not white noise.
For full employment, policy is successful, if and only if u
= uf and
=
*, so that:
ERU[FE] = uf = f -1[ (1 -
) (
-
[-1]) ] (28.4)
This equation has the same format as ERU[REH]. It follows that uf can be REH, and REH could be uf. However, they need not be, since, though we have used the same symbol f, in practice there can be different functions and also additional variables depending upon the FE or REH assumption. [90]
Similarly, with this specification there might be constancy, and of course there might be not. And as said, constancy might not be the real issue, as small fluctuations in a stable range might be acceptable too. [91]
In the selection of f[u] we have to take account of the fact that u can shift as a result of the minimum wage. Workers below the minimum wage are not relevant for the labour market, and do not exert a downward pressure on wage inflation. Above we saw that u = un + um. Let fu[un] give the fundamental nonshifted relationship for that part of unemployment that still affects the development of wages. Conforming to empirical regularity:
fu[un] =
-
Log[un +
]
Here
is a parameter for horizontal adjustment,
gives the slope, and
is a constant shift
in u. Note that fu[un] may be very sensitive to low values
of un and
,
since the logarithm from 0 till 1 is very steep, and un commonly is
measured in percentages and thus covers that range. Now, for f[u],
an endogenous shift in u then can be included by:
f[u] = f[un + um] = fu[un] = fu[u - um] =
-
Log[u - um +
]
Note that f[u] here is also acceleration, since 1/(1-
)
disappears in
and
. Figure 24 gives two regimes, plotted for both the f[u] in the left part and the Phillipscurve in the right part. Parameters are
=
= 5,
= 0, and um = 0
[case (a)] respectively um = 6 [case (b)]. It is assumed that
* =
[-1]
= 2 respectively 5, so that the minimum wage unemployment of 0
associates with an equilibrium wage inflation path of 2, while the high
minimum wage unemployment of 6 associates with a high wage inflation
path of 5.
Since
* =
[-1] the CWIRU’s can be found when f[u]
= 0, and these result in values of 2.7 and 8.7 (= 2.7 + 6).
Figure 24: Dynamics: unemployment and inflation

Given the assumption of
*
=
[-1] it also follows
that the Phillipscurves are just horizontal translations of the f[u],
and one can see the values of 2, respectively 5, for the assumed wage
inflations at the CWIRU’s.
The cases (a) and (b) in Figure 24 reflect the developments in the OECD in the 1950-2005 period. Case (a) gives the situation somewhat like the 1950s. The trade-off of inflation and unemployment then took place at low rates along the long drawn line. The trade-off of wage (price) acceleration and unemployment gives the CWIRU. At that point price acceleration is zero, and inflation remains at a low and constant value. Case (b) gives the situation of stagflation, where both the CWIRU and the trade-off-process around it have worsened. The move from (a) to (b) can be called ‘stagflationary’. In the 1960s and 1970s authorities targetted for low unemployment at the cost of rising and eventually high inflation. In the 1980s and 1990s the authorities targetted against inflation and accepted high unemployment.
The short term Phillipscurve concerns the direct trade-off of unemployment and (wage) inflation and is given by the long drawn curves. This trade-off has only limited explanatory value. Nowadays unemployment is concentrated at the low income section of the income distribution, and it is not likely that this can be battled with high wage inflation. This phenomenon is rather explained by the shift of the CWIRU or the long run relationships between equilibrium unemployment and wage acceleration, which are given in the left diagram.
It is useful to note:
· The CWIRU need not be constant. It could be if e.g. the relation indeed is linear and if the coefficients are fixed. But neither need be the case. The CWIRU in all likelihood is itself a variable that traces out a path. (Which is another reason why the name ‘natural rate’ is unfortunate.)
· There is a movement of the curve and a movement along the curve.
· The movement of the curve is not determined by the labour market alone. Policy makers may neglect labour market measures, and may opt for high inflation (1970s) or for high interest rates (1980/90s) to fight minimum wage unemployment that is not affected by these.
We may recall the 1995 Nobel Prize for Robert Lucas. The Swedish Academy put the following text on the internet:
“The change in our understanding of the so-called Phillips curve is an excellent example of Lucas’s contributions. The Phillips curve displays a positive relation between inflation and employment. In the late 1960s, there was considerable empirical support for the Phillips curve; it was regarded as one of the more stable relations in economics. It was interpreted as an option for government authorities to increase employment by pursuing an expansionary policy which raises inflation. Milton Friedman and Edmund Phelps criticized this interpretation and claimed that the expectations of the general public would adjust to higher inflation and preclude a lasting increase in employment: Only the short-run Phillips curve is sloping, whereas the long-run curve is vertical. This criticism was not quite convincing, however, because Friedman and Phelps assumed adaptive expectations. Such expectations do in fact imply a permanent rise in employment if inflation is allowed to increase over time. In a study published in 1972, Lucas used the rational expectations hypothesis to provide the first theoretically satisfactory explanation for why the Phillips curve could be sloping in the short run but vertical in the long run. In other words, regardless of how it is pursued, stabilization policy cannot systematically affect long-run employment. Lucas formulated an ingenious theoretical model which generates time series such that inflation and employment indeed seem to be positively correlated. A statistician who studies these time series might easily conclude that employment could be increased by implementing an expansionary economic policy. Nevertheless, Lucas demonstrated that any endeavor, based on such policy, to exploit the Phillips curve and permanently increase employment would be futile and only give rise to higher inflation. This is because agents in the model adjust their expectations and hence price and wage formation to the new, expected policy. Experience during the 1970s and 1980s has shown that higher inflation does not appear to bring about a permanent increase in employment. This insight into the long-run effects of stabilization policy has become a commonly accepted view; it is now the foundation for monetary policy in a number of countries in their efforts to achieve and maintain a low and stable inflation rate.”
The Academy is a bit too assertive. The Phillipscurve need not be vertical in the long run. It may well be that there is no fixed solution, and that the long run gives a non-converging movement. Also Phelps (1994) has reminded us that the CWIRU (in his words the NAIRU or ‘natural rate’) need not be constant.
Secondly, there can be other causes than expectations, and these might be more important for understanding the present situation. One important cause is the mechanism of the minimum wage. Hence the models used by Lucas and his predecessors need not be the relevant models for explaining the empirical shifts in the Phillipscurves and their CWIRU’s.
If labour is heterogeneous, then utility maximisation and rational calculation are not only directed at demanding a competitive wage, but they are also directed at selecting the kind of submarket (and its associated wage). This complicates the situation. Can we say that a dentist is ‘unemployed’ in the market for farmers ? Or closer linked, that an assistant professor is ‘unemployed’ in the market for professors ? However, we may note that an individual who sets his wages too high will become unemployed in any submarket. This causes an intuition that the selection of submarkets can still be represented by wage schedules. There will be more equilibrating forces than wages only, e.g. education or migration, but it can be reasonable to concentrate on wages.
With heterogeneity, the unemployment that is relevant for a submarket will have effects on the evolution of the wage in that submarket. Aggregating, however, we get an effect of macro unemployment on the average wage. Hence above simple relationship can be retained, but its interpretation changes from homogeneity to aggregation of heterogeneous submarkets.
Above we used um to show how the Phillipscurve can shift. Note that this in fact has only been a didactic procedure. I wanted you to understand the formulas, and it appeared very instructive to draw graphs of shifting Phillipscurves. However, when there are LS homogeneous labourers, we have some difficulty explaining why (1 - u) LS could work and u LS could not, even though they essentially are the same. Hence minimum wage unemployment and the shift of the Phillipscurve due to it, properly belong to the world of heterogeneous labour.
We here can extend the list of factors that can cause a shift in the aggregate Phillipscurve:
· The match of demand and supply above the minimum wage may cause separate problems. We will discuss the issue of crowding out on the labour market below.
· Vacancies will strengthen the position of employees and their unions. Employers may nevertheless wait with filling vacancies in order to find better opportunities later.
· There is ‘forward shifting’ of the tax burden T[w] / w from employees to employers (and then into product prices).
· The Labour Cost Quotes w / y may not just affect the equilibrating wage (or expectations) but may as well cause a shift.
· Poverty - see below.
We would basically model all submarkets - with minimum wage unemployment of course only occurring at the bottom. However, let us first look at the macro level only. Let us be the summary shift variable inclusive of all factors including um. Let usr be the summary shift variable exclusive of um. Let v the rate of vacancies, TAX/WT the tax burden. Let History be the history of all variables. Then redefine f[u]:
us = us[u, v, TAX/WT, WT/Y History] = um + usr[u, v, TAX/WT, WT/Y, History]
f[u] = fu[u - us] =
-
Log[u -
us +
]
A crucial topic is crowding out on the labour market. Highly productive labour can replace lowly productive labour more easily than conversely, and this has effect on wage claims. This might be something like a continuous version of the insider-outsider theory.
Unemployment among the higher skilled is not large. The analysis here is that this is caused by crowding out on the labour market. When potentially higher productive people face the choice between unemployment and a comparatively lower paid job, they choose the latter (noteably when they are tired of waiting or when the benefit runs out). They thereby “take the places” of others - who repeat the process to others below. The initial set-back in pay level tends to translate into demand for pay rises. Who crowds out, has a stake in trying for pay rises. A lot of crowding out will cause a mood for inflation. Who have been crowded out towards unemployment, have some incentive not to inflate, but have little countervaling power against the general mood for inflation.
Figure 23 already presented the stylized fact for labour demand and supply, i.e. that vacancies tend to occur at higher income and unemployment at lower income. [92]
There is a meaningful aggregation of vacancies and unemployment by subcategory of low and high productivity workers, giving Vl, Vh, Ul and Uh. When vacancies are asymmetrically relevant only for the higher incomes (V ~ Vh, Vl ~ 0), and when there are always vacancies for higher incomes due to crowding out (Vh >> 0), then V is not that important. However, V may become important again when Vl is made nonzero by proper tax policies. If low productivity labour has a stronger position in the labour market, then the risk of unemployment is spread more evenly, and trend-setting high productivity labour will be cautious about wage claims. High values of Vl and Uh, i.e. vacancies for the low productivity group and unemployment for the highly productive group, have the largest wage checking effect. High Vl and Uh make it difficult for the trend setting higher productive workers to shift the risk of unemployment to the lesser productive workers. We will not formally develop this point.
Crowding out on the labour market typically refocusses the policy co-ordination problem to the lower end of the market. This phenomenon tends to reduce the problem and our vocabulary in these pages to social subsistence, tax exemption and (legal) minimum wage.
A crucial difference between the United States and Europe is that the US accept more poverty (e.g. by low controls on its minimum wage laws), while Europe chooses high minimum wages and benefits to raise standards of living. The shift of the Phillipscurve thus is more obvious and stronger in Europe than in the US. In the US the working poor still work, so unemployment is lower, and the shift of the Phillipscurve is less strong. Sometimes the argument stops here. It remains a topic of consideration though whether more than just this can be said about poverty.
Poverty affects productivity directly. A clear case is medical care. With less medical care, there are longer periods of illness, and more chances for complications of a less well attended illness. Employers are less likely to hire less healthy persons.
Poverty affects personal appearance. A shabbily dressed and badly groomed individual has less chance of employment than a person of average appearance.
Poverty affects social attitudes. Social seggregation and cultural differences reduce the chances of employment.
Poverty affects capacities. Rich people need not study much, need not read many papers, and may only watch soap operas. They are rich, and can enjoy themselves. But those of the rich who would like to study, read, watch serious tv programs, and drive out to educational events, have the means to do so. Those who are not that rich, and those who have to study to maintain a higher living standard, may work and still earn enough to enable them to study. Those of the poor section that might want to do the same, do not have those means.
One aspect of US poverty is crime. Poverty does not actually force people to crime, as some people demonstrate, but for many it in fact appears to be very seductive. Jacobs (1996:573), referring to Freeman (1996:25-42), explains that about 2% of US males is in prison, about the same rate as long term unemployment in Germany. Taking account of women, the overall US imprisonment rate is about 1.2%. The highest rate of European imprisonment is for the UK, with 0.3%. So for the US we might add 0.9% to the unemployment rate.
Also, additional 5% of US males is on conditional leave etcetera from the prison system. More have a criminal record. Those points reduce the chance for employment.
Some of these points, like imprisonment, work directly as a minimum wage. Some other points rather affect the employment or earnings distribution, and cause a structural rise of Ul.
Here, for simplicity, we take the wage level w instead of wage inflation. The rates of change can be found by comparing to w[-1].
Wage w, a continuous vector for each market, depends upon the power position of employers and employees, which is determined, amongst others, by the relative situation of unemployment versus vacancies. Since unemployment and vacancies have been expressed above as functions of w we solve w as a fixed point. We also add the equilibrating w* (or expectations E[w]) that are a function of product y, the tax burden for forward shifting, the labour cost quote, macro variables and the history of the variables. The submarkets Phillipscurves can include influences of other submarkets and general developments pertaining to all markets. A macro-economic hypothesis is that the development within markets is not merely influenced but even dominated by general events. The relationships are clearly dynamic, and we thus read all variables as time dependent.
w[y, T, Macro] = w[ w*[y], ud[w], vd[w], T[w] / w, w / y, Macro, History ]
Note that modern large models depend upon convergence techniques, and that the computation of fixed points can be included into convergence in general (though it would be computationally burdensome).
The stylized facts can be summarized as: [93]
· In the 1950-1970 period, welfare states generally had a high tax exemption level and full employment.
· In the 1970-2005 period welfare states generally had a low tax exemption level. To ensure a decent stardard of living, required gross income then rose and exceeded productivity in the low end of the market, generating unemployment, while shifting the Phillips curve and reducing its sensitivity.
· Even when the statutory tax system has a low exemption level, then subsidies for the lowly productive keep them in work. And subsidies can be at the firm or state level. This is crucial for the Japanese and Swedish experiences, see e.g. Aoki (1990) and Standing (1990). Note that, in a reduced form, subsidies turn up as ‘system-wide exemption’. A subsidy is no ‘real’ subsidy if it compensates for wrong taxes.
Measures to block crowding out boil down to giving the low productivity group some guarantee for work at decent income. Such guarantees can be collective/semi-private arrangements of the Swedish/Japanese type. For the more common mixed economies, the guarantee is market-conforming, and notably consists of tax exemption.
Taxes are relevant for the discussion of stagflation at least for the following reasons:
(1) Taxes divert income and thus affect aggregate demand, especially when tax revenues go to benefits and consumption instead of saving and investments.
(2) Taxes are thought to cause forward shifting, i.e. that taxes are shifted into wage costs, which then may cause inflation.
(3) Taxes reduce net wages, and might affect the supply of labour. Statutory marginal rates are thought to have disincentive effects.
(4) If exemption is lower than subsistence, then a higher minimum wage is required. Differential indexation widens the gap.
In the following we will first discuss the relation of social insurance premiums to the economic concept of a tax. Then we regard the common tax structure of OECD countries, where the structure concerns both a statute and the dynamic adjustment policy. We introduce a nonlinear tax function and rules on indexation that captures this structure. We then show the effects of differential indexation, and present our new analysis on marginal rates.
Tax dynamics can be split into two components: the dynamics of the short run - where a local temporal equilibrium is attained using the calculations on the marginals - and the dynamics of the long run - where the locus of possible equilibrium points is shifted by long run effects on the levels of the variables. Both components appear to be equally important for our understanding of the subject. The observations on the long run can be usefully discussed in conjunction with the theoretical developments.
In our discussion we will take premiums as part of taxes in so far as it is economically relevant to do so. This may need some clarification.
Premiums for old age, sickness, disability, unemployment and the like are often regarded as insurances, and studied separately. In the practical situation of empirical economies these provisions are often indeed administered by separate institutions called ‘insurance companies’. And there indeed exists the possibility to apply the mathematics and economics of insurance to these topics. However, that these provisions are called ‘insurance’ should not cause us to regard them as only such. Part of these so-called insurances are provisions for the efficiency of the labour market.
To understand this, let us take the case of a low wage labourer. Suppose that he would have to pay such an amount of premiums, for only a limited package of insurance, that his net wage would make him eligible for benefits, or his gross wage would make him unemployed so that he also gets a benefit. Once he relies on benefits, the mentioned insurances are provided for him for free.
This thus shows the structural identity of the problem of exemption in ‘insurance’ with the problem of exemption in taxation. Hence, on economic grounds, insurances here are lumped together with taxes, in so far as they are provisions for the well functioning of the labour market.
Note too that governments would be wise to follow a ‘basic insurance policy’ which holds that workers can be insured up to a basic level but without payment of premiums. This reminds of the ‘basic income argument’, but only applies to the mentioned premiums. Similarly poor people exempt from taxation receive public goods, without paying for them.
Most developed nations have nonproportional taxes, i.e. tax codes with an exemption at the threshold and then a (rising) statutory marginal rate. The latter parameters in fact concern the intercept and the slope of the tax function. There is also a remarkable similarity in the policy regarding these two parameters (or sets of parameters), see OECD (1986):
·
The policy feature concerning the intercept or exemption.
Exemption generally is low, also with respect to social insurance.
Tax parameters, and notably exemption, are generally indexed on
inflation. Since incomes tend to grow faster than inflation, exemption
lags behind incomes.
There is a deliberate tax creep - measured by the ‘macroeconomic
progression
factor’.
·
The policy feature concerning the slope or the statutory
marginal rate.
Both in theory and public discussion there is a consideration that high
marginal rates have disincentive effects. This has resulted in the policy
objective to reduce marginal rates. One way to reduce marginal rates has been
the switch from income tax to VAT.
Given the common notion of budget neutrality, these two features in policy tend to complement each other. Budget neutrality requires that the revenue loss due to slope reduction is compensated for by other proceeds. These other proceeds will often come from the tax creep and the reduction of exemption. At least, it is often thought that the reduction of exemption generates additional revenue. This, however, turns out to be a wrong assumption.
Book III introduced the Bentham tax function Bentham[y] = r (y - x) with exemption x and marginal rate r. This function is linear but already results into nonproportional taxes. Governments in practice have nonlinear tax schemes that give stronger nonproportionality, reflecting political views on the redistribution of income.
Strong nonproportionality has a special effect. Since taxes in the 1960s were more nonproportional than nowadays, the tax structure combined with the lognormal shape of the employment function, and generated strong nonlinear effects and a strong upswing of the CWIRU in the early phase of stagflation.
It is useful to introduce a flexible tax function with one more parameter than Bentham’s function to incorporate some curvature. This new function allows us to give concrete examples whenever nonlinearity is useful. For clarity, it appears that this function can approximate the actual Dutch tax situation. The tax function is:
(y
> x)
with y the tax base and x the exemption or threshold, r the marginal rate in the limit when y goes to infinity, and c a curvature parameter. The ordered set of parameters is q = (r, x, c). [94] We do not use Greek symbols for these parameters since we will regard them as key strategic variables. If governments would use this function for practical tax collection, they might note (1) that exemption would be determined by subsistence, (2) that r would follow from the limit marginal rate for the highest incomes, (3) so that curvature c would follow from required total revenue and the income distribution. Use of this function thus both allows for a decent degree of nonproportionality and would reduce much of political debate about positioning of tax brackets and rates.
A person’s average tax is:
![]()
The marginal rate on the marginal dollar can be approximated as T[y + $1] - T[y] so that the common tax payer will have no problem in determining it. The proper formula itself is not too simple. At y = x it starts with the value r x / (c + x) and in the limit it equals r. For the whole range:
(29.1)
Note that the tax function can be transformed into a linear format consisting of income, average tax and a constant:
Tax[y] = r.y - r.x - c.Tax[y] / y = a1.y + a2 + a3.ATR[y]
Colignatus (1992) used this relation for a simple linear least square estimation that neglects the error on the average on the right hand side, using 1988 Dutch data for 12 selected income levels. The result was:
(in 1988 $)
The equation can be plotted for two ranges, (H1) for a low income range till $25 thousand to show the curvature, and (H2) for a wider income range till $250 thousand to show the straightness in the limit. In a plot, the 45-degree line is usefully added to allow visualisation of net income. Since the Dutch estimate has a high marginal rate in the limit of 57.2 %, we add US-alike lines (U1) and (U2) with a r = 40 % limit. The two ranges are plotted in Figure 25.
Figure 25: Different tax regimes 1988 ($1000)
(H) Holland, (U) US-alike

The nonproportional tax clearly becomes important when incomes differ, i.e. labour is heterogeneous in terms of productivity, labour costs and income. Lower income earners are affected disproportionally by the exemption level, not merely in terms of the income distribution but also in terms of their competitive position versus higher earners.
In Book III, equation (13.1a) already shows how the minimum wage consists of two elements. For above tax function:
![]()
Analytically solving for the minimum wage gives, due to the nonlinear curvature, two solutions for M[B, r, x, c]:
![]()
![]()
Note that the denominators are positive, so that the first
solution is more adequate. If exemption is taken at x = B, then
these two solutions degenerate into M
B and M
- c / (1 - r).
Figure 9 and Figure 8 in Book III plot the tax situation and the effect of M and B for curvature c = 0 (in the considered range), and for Holland 2002.
We already mentioned the OECD (1986) report that taxes generally are indexed on inflation. This indexation though is not consistent over time. The Economist (1991:45-46) reported:
“the most intriguing proposal now doing the rounds in Congress (...) is to increase the personal tax exemption (the amount by which taxable income is reduced for each person in a household). In 1948 the exemption was set at $600 a person; in 1990 it was $2050. According to recent evidence before the House of Representatives select committee on children and the family, had the exemption been indexed from 1948 it would now be worth $7800.”
The Dutch data had already been given in Table 4. Indexation on inflation need not be optimal. We already looked at indexation of subsistence, and it might be wise to index taxes on the same base as gross income, as suggested by property (13.3e) and the discussion on subsistence in chapter 27.
Statutory taxes generally take account of the household situation. Sometimes tax terminologies suggest an individual treatment. Regard for example the Dutch tax code. This states that partners can ‘transfer their exemption’ to the money earning partner. You may check that Table 4 on the Dutch situation indeed shows an exemption for partners, in the 1997 column, that is double the exemption for singles. The situation in 2002 is a bit more complex due to an EITC.
Note, though, that the Dutch minimum wage roughly is set at the income level for partners. Singles have less net income since their exemption is lower, but they are not allowed to work at a lower gross minimum wage that might be feasible, with the same net income by assigning them the same exemption as for couples. The Dutch concoction of ‘exemption transfer’ in fact is extremely silly. It is even more surprising that it has been introduced while all Dutch tax specialists kept a straight face. [95] The concoction also complicates the Dutch policy debate, since a proposal to raise exemption to subsistence now associates, in Dutch minds, with exemption for couples of double subsistence (which is exorbitant).
The best tax format would start with exemption at subsistence for singles.
Secondly, for partners with a single earner, a measure of ‘individual taxation’ can be introduced in the following manner. The basic ideas are:
· Home maintenance produces a product, this product is real income, and income should be taxed. However, part of home maintenance also can be part of subsistence.
·
We may allow for a degree of spillover
of income from one partner to the other.
This is the public good argument, i.e. that more people can benefit while the
cost is constant.
· Not all interaction is just spillover. Part of the interaction concerns an economic transaction. While the single person has to work for his home maintenance, he also buys it from himself. The single earner out partner buys it from the home partner. Revenue from this transaction should be taxable, i.e. on the side of the person that receives the payment.
Let yh stand for the income of the home partner, and yo for the income of the out partner. Let us use the Bentham tax, and apply it individually. Assign virtual income H to parttime home maintenance activities - and we are ignorant about the required hours. Let parttime virtual home maintenance income be part of exemption x = B’ = B + H, with B money subsistence or the net minimum wage on the market. The situation is neutral for a single person, who’s exemption is x = (B + H) while his income is y + H. The couple however is treated as follows:
·
The out partner earns on the market y, buys Ho from
the home partner, and has spillover
yh of the income of the home partner. Buying something
does not add to income however. Income thus is yo = (y +
yh), and the tax
thus is found to be r (y +
yh - B - H)
·
The home partner has own virtual income Hh, earns income Ho
from the out partner, and has spillover
yo of the income of the out partner. Income
thus is yh = (Hh + Ho +
yo) = (2H +
yo) since Ho = Hh = H (we
used the indices only for the origins). The tax thus is r (2 H +
yo - B - H) = r (H
+
yo - B)
·
Combined income thus is yo + yh = (y +
yh) + (2H
+
yo)
which consists of earned income, home production and spillover
(yh + yo)
The equations solve as:
![]()
In the special case that the tax authority thinks that spillover is zero, then the out partner gets a tax rebate of rH in comparison with the single person. The home partner would not have to pay taxes when H would be less than B (half a day home maintenance work would be less than a day at a minimum wage). In this case the couple has more net income than the single person, and the products of another persons work, though on a pro-person base they would have less. Conversely, if home maintenance is a highly priced good, then there could be a case to levy taxes.
If spillover is a nonzero constant, then there is an income
level y where the taxable income of the home partner H +
yo - B will become
positive. A person will have to pay taxes ‘just because’ he or she forms a
couple with a high income earner. If spillover is nonzero but variable, then
the value of
that
makes taxable income of the home partner exactly zero follows from H +
yo - B = 0, and
appears to be a function of income y:

If B = 2H (i.e. home maintenance gets the minimum
wage), then for y = B,
= 1/3. This means that the partner remains exempt from taxes as long as
spillover is limited to a third of income. Interestingly, at that point also
the taxable income of the out partner is yo = (B - H) /
= 3 H so that he
does not pay taxes either (since x = B + H = 3H here).
Above relationships show that individual taxation is possible that takes into account household spillover effects. For us the issue is primarily interesting for complications about subsistence. We find that there are no great complications, and we thus will further neglect the issue of partners.
With subsistence indexed on income and taxes indexed on inflation, there is differential indexation, and due to the tax structure there is a multiplier increase in the minimum wage. Required gross minimum M shows a relative rise compared to other incomes, and it rises faster than both net minimum B and the general level of income Y/LE. In Figure 10 (in Book III), when we subtract the inflation component from x, B and M, then differential indexation shows up as: x stays fixed, B moves with the income density, M moves to the right, and M, as the intersection of the subsistence and tax lines, moves up more speedily. If productivity in the lower earnings scales doesn’t rise faster than general productivity or income, then ever more people grow unemployed.
For all clarity we shall prove this. This chapter uses the specific tax function (chapter 39 will give a proof independent of form). First we will show that M grows faster than B, and then we will show that M grows faster than productivity too, causing unemployment.
Let us first derive the real subsistence index rsi again, but now for the nonlinear tax. Recall the definitions of Book III. Let B = rsi P B[0] with B[0] subsistence in the base year. Let exemption x be adjusted for inflation with index P, then x = P x[0], with x[0] the exemption in the base year that now may differ from subsistence in the base year B[0]. Let also c be indexed on inflation as c = P c[0]. Let the average wage index be W = P rwi W[0], with W[0] the average wage in the base year. Let h = x[0] / W[0] and f = c[0] / W[0].
![]()
rsi = Net[W] / Net[W[0]] / P =

which for f = 0 reduces to the Bentham-rsi deduced in Book III. For the limit, in general, we find:

which is normally below 1. Denote the denominator as F, and note that W[0] F = Net[W[0]] or F = 1 - ATR[W[0]].
We use these properties for the following theorem.
Theorem T.1: With Tax[y, q], minimum wage setting M = B + Tax[M], and balanced growth, then: if B is indexed on the net average wage and x and c on inflation only, then M rises faster than other wages, and unemployment rises.
Note: That M rises faster than other wages is not inconsistent with balanced growth. For M is only the selection of one of the proper wages that is taken to be the minimum wage.
Proof:
For all clarity, parameter r will not be indexed. Let the price level index again be P. Again W = P rwi W[0]. With real wage index rwi, the nominal index is wi = P rwi. For heterogeneous wages with wage density, we have w = wi w[0] along the balanced growth path.
For a dynamic path we have starting position B[0] giving M[0]. In the base year the minimum level is taxed at an average rate less than r, implying that B[0] > (1 - r) M[0].
We also use J as the index for the real minimum wage:
M = P J M[0] i.e. J = M / (P M[0])
(1) We first prove that J > rsi in the limit. There are two relations for B, with rsi given by the relation above:
B = P rsi[rwi] B[0]
B = M - Tax[M, (r, P x[0], P c[0])]
= M {1 - r (M - P x[0]) / (M + P c[0])}
These equations define J as an implicit function of rsi. We also see that P falls away in the right hand side:
B = P rsi B[0] = M {1 - r (M - P x[0]) / (M + P c[0]) }
rsi B[0] = J M[0] {1 - r (M[0] - x[0] / J) / ( M[0] + c[0] / J) }
As rsi and J go to infinity, then rsi B[0] ~ J M[0] (1 - r). We had B[0] > (1 - r) M[0]. Thus J > rsi.
(2) We secondly prove that J > rwi in the limit. With limit ratio R:

using the fact that the denominator equals F defined above. We want to prove that R > 1. Note, then, that M[0] < W[0], and that, due to the progressive character of the tax, the ratio of net income to total income must be higher at subsistence than at the average level, so that:
R = B[0] / M[0] / (Net[W[0]] / W[0]) > 1
(3) Thirdly, we look at productivity and employment. For this theorem, the worst case to start from is full employment. When we start with full employment at M[0], then M[0] provides the equilibrium of supply and demand. Let the supply price (or gross income or productivity) at the minimum be ms[0] and let the demand price (labour costs) at the minimum be md[0]. [96] Then in the assumed start situation of full employment M[0] = ms[0] = md[0]. Assuming balanced growth for demand and supply gives the development of the labour market situation at the bottom:
w = P rwi w[0] in general, i.e. for all w
md = P rwi md[0] & ms
= P rwi ms[0]
This means that the supplied (inherent) productivity of those at the (original) minimum grows as fast as the labour costs which employers could afford. However, the true supply price is not productivity but the (actual) minimum wage M that grows with P J and thus faster than the md. People in the class [ms, M) will not find jobs paying the social minimum. They become unemployed.
Q.E.D.
Above theorem and proof may be regarded as a bit simple. However, they help to highlight some useful aspects:
· Differential indexation can have surprising consequences compared to conventional ideas.
· Instead of thinking that productivity growth reduces employment for the lowly productive, we grow aware that it is likelier that technology creates so many job possibilities that employers can finance even higher costs than subsistence. But the multiplier effect from wrongly indexing taxes can be even faster.
· There is the combination of nonlinear tax and lognormal productivity, which causes an upswing of the CWIRU in the early phase of stagflation.
· This holds for a wide class of tax functions, even some very nonlinear ones.
· Where the term ‘income tax’ is used, it also applies to VAT and insurance for old age, disability and the like, as long as part of these are considered to be part of subsistence and thus should be included in exemption.
· This theorem and proof are for a structural form, and inspire the theorem and proof for the reduced form that we discuss later.
Our analysis points to the suggestion of ‘waiving taxes for the lowly productive’, which can be translated as ‘raising exemption’. Interestingly, this latter translation appears to provoke some terminological confusions.
The notion of ‘raising exemption’ is often taken to imply that all other brackets shift along with exemption. This causes a huge loss of tax revenue. E.g. Gelauff (1992), who uses the official general equilibrium model of the Central Planning Bureau to compute the economic impact of raising exemption, adopts this expensive approach. (His scenario also includes the Dutch concoction of the ‘transfer of exemption’ by partners, so that his implementation is even more expensive.)
However, there are some alternative implementations. Their common feature is that taxes above the current minimum wage are essentially unchanged.
The issue can be clarified by the following two graphs. In Figure 26, the function with an exemption (bold line) can be compared to a function without an exemption (thin line) but with a tax credit (bold line again). The tax credit is given as c = r1 x where r1 is the rate of the first bracket (taking that as defined by the tax credit). The two systems are mathematically identical, when seen as a vertical translation while keeping the bracket positions fixed.
Figure 26. Piecewise linear tax function with more brackets
A dubious and horizontal transformation is given in Figure 27, where the assumption of ‘fixed bracket lengths’ has been assumed rather than ‘fixed bracket positions’. When we now substract a fixed sum from the line through the origin, the original function cannot be retrieved, and the higher incomes pay more tax. It now seems as if the tax credit is ‘fairer’. However, the true cause is that taxes have been raised by shifting the bracket positions.
Figure 27. Horizontal translation
The Dutch Government “Tax Plan for the 21st Century” used this misleading horizontal translation to argue that tax credits would be more just than plain old exemption. See Colignatus & Hulst (2003:32) for the misleading statements.
Useful approaches are:
1. Introduce a new separate ‘tax group’ that only holds for workers below the current minimum wage. Let this group have a high exemption at the new minimum wage and a normal marginal rate of 50%. Clearly, there could be jump in taxes at the current minimum wage. However, the high exemption can be said to apply to all citizens - and many simply don’t qualify since they do not fall in the new group. (The latter is only unfortunate for them, if they prefer a high exemption above their current high income.)
2. One might opt for a 100% marginal rate from subsistence (the new minimum wage) up to the current minimum wage. In this case there is no tax jump. High exemption again applies to all citizens, but its effect is undone by an intermediate high marginal rate region. Whether this is considered to be a bad situation, depends upon the analysis of marginal tax rates: see below.
3. Introduce a nonlinear trajectory from subsistence to some place in the current regime. Since reduction of wage costs generates employment, the state saves on benefit payments, and some revenue can be used to reduce taxes also above the current minimum wage. This reduction can be done in a nonlinear way that allows for a fluent change, without jumps and without new tax groups. Figure 28 gives an example of such nonlinear trajectory, where the function Tax[.] has been estimated to fit the 1997 Dutch tax code (inclusive of premiums) but with a nonlinear repair towards subsistence. The special point is that this estimated Tax[.] has a negative curvature parameter. The 1988 income distribution has been used to approximate tax revenues. The currency here still is Dutch guilders.
Figure 28: Nonlinear repair Holland 1997 (Dutch guilders)

4. Figure 29 uses euro’s and the new Dutch tax code and minimum wage of 2002. Using a 75% first bracket allows the minimum wage to shift from M1 to M2. The shaded area gives the tax revenue lost, which would be compensated by saved benefits.
Figure 29: Linear repair Holland 2002

We will discuss the optimal regime later, and return to the issue of raising exemption. This paragraph here was useful to clarify some terminological confusions. It also indicates that marginal rates will feature strongly in the discussion about the repair. A marginal rate of 100% or the marginal rates associated with negative curvature seem prohibitive for practical implementation. At least, in the conventional wisdom.
A common topic in the subject of taxation is the concept of a negative income tax (NIT). A person below a certain threshold receives money instead of paying it. The negative income tax can be presented as a ‘basic benefit’: all members of society receive allowance A from the state, and pay taxes only on their additional income. The negative income tax or basic benefit is often presented as a solution to the current unemployment problem. The Central Planning Bureau (1992a&b) in fact shows that this can work.
It is useful to clarify the following. We can distinguish three groups with different effects:
· for the currently employed the NIT has no effect, since they already are employed and in fact already earn their own basic benefit
· for the people in the Tax Void, the NIT effectively only means the increase of exemption, and thus one might as well increase exemption
· for workers with sub-subsistence productivity, the NIT indeed provides additional revenue.
The second effect cannot properly be regarded as a positive effect of a NIT. Only the last effect is the NIT proper. However, proponents of the NIT often include the second group when they claim good results. In the current situation of mass unemployment, the employment effect will also be largest for the second group, so the effects of the NIT are grossly overstated. You may be familiar with the joke of the mouse and the elephant walking on a bridge, and the mouse proclaiming: “We make quite a lot of noise together, don’t we ?”
It must be noted that proposals on the NIT generally state huge sums of money. The NIT is very ‘expensive’ since all spouses would apply, causing the need for more changes in the tax code. [97]
The NIT complexities, and huge sums, also obscure the fact that abolishing the Tax Void would be for free. Proponents of the NIT thus can be compared to people at Amsterdam Schiphol airport wanting to go to Washington, and waiting at the ticket booth till they have enough money to buy the expensive ticket, while they overlook that, due to circumstances, the plane to New York flies for free.
The concept of a NIT, intended to do good, generally seems to cause people to do a lot of harm. The Central Planning Bureau (1992a&b) study assumed the gradual introduction of a NIT in the course of 25 years, keeping subsistence fixed at a constant inflation adjusted value of 1990, and the NIT fully introduced at that value in 2015. This scenario thus has the drawbacks of (a) achieving full employment only in 2015, (b) not indexing subsistence to general welfare.
It may well be that the Ministry of Finance is less equipped to deal with employment policy including the measurement of potential productivity. It would be better to quickly abolish the Tax Void, index subsistence properly, and restore the normal processes of social security and workfare to assist the sub-subsistence group.
The following equations clarify the relation between the NIT, exemption and subsistence. With market income y, the Bentham tax function Bentham[y], allowance A from the state, then net income and implied tax are:
net[y] = y - Bentham[y] + A = y - r (y - x) + A
implied tax[y]
= y - net[y] = r (y - x) - A = r (y - (x + A/r)) = r (y - x
)
So by taking x
= (x + A/r) the allowance
in fact means adjustment of exemption, with the subtle difference that x
now just stands
for the intersection with the horizontal axis, and not with exemption proper.
Normally A would be chosen such that net income at subsistence y =
B equals B, so that we might as well raise exemption to
subsistence:
B = B - r(B - x)
+ A
A = r (B - x)
x
=
B
The economic literature shows a conceptual problem, or paradox, on marginal rates. Statutory marginal rates are important in popular understanding, but not in the empirical data. Research, as witnessed by the existing literature such as Gelauff (1992), deals better with the data, but doesn’t convince the popular view. The following analysis suggests a solution.
Conventional theory, public discussion and empirical
research generally use statutory rates as the “marginals”. With T[y] the tax associated with income y, the marginal rate commonly is computed as
T[y]/
y. For our function
this is the partial derivative as used in equation (29.1). However, the tax
function is better understood not as T[y] but as the multivariate
T[y, q] with q the (now arbitrary) tax parameters. Agents
will tend to take account of parameter changes. So optimisation remains our
paradigm - and it results into marginal rates - but the better marginal rate is
the total derivative, [98]
or dynamic marginal rate (DMR):
dT[y, q]
T[y, q]
T[y,
q]
----------- =
------------ +
------------ dq / dy
dy
y
q
The topic of discussion is dq / dy. To proceed from this point, it appears didactically useful to first restate the conventional reaction to the DMR, and then develop the new analysis.
The conventional reaction is that tax parameters may be indexed to national income, but are not indexed to personal income. The individual agent in the economy will not think that his change in income can affect national tax parameters. Hence dq / dy should be zero.
Let us use the Bentham tax function again. Let us assume
that only exemption is indexed on national income, and in continuous form the
indexation reads as x =
Y with
as a fixed value for a base year. Thus:
T[y] = Bentham[y,
Y] = r (y -
Y)
It appears that
is very small. For example, with LE the number of
tax payers, and Y / LE average income, we may take exemption as a third of average income, so that
= x / Y = 1 / (3 LE). But the small size does not
invalidate the indexation method, since:
dLog[x]
= dLog[
Y ]
= dLog[Y]
Note that Y is the sum of all incomes. An income change for an individual does not affect the income changes of others. Assuming that other incomes stay fixed, we find for an individual income dY / dy = 1. If y rises and no other income rises, then the growth of national income dLog[Y] is equal to the growth for the single person weighted by its share in total income:
dLog[Y] = (y / Y) dLog[y]
It follows that the marginal tax for the individual is:
d T[y] / dy = r (1 -
)
Now, since
is such a small number, the marginal rate is virtually equal to r.

In general we find:
dq / dy = (dq / dY) . ( dY / dy) = dq / dY
Since dY / dy = 1. If parameters are indexed on national income, then dLog[q] = dLog[Y] and then dq / dY = q / Y so that
dq / dy = q / Y
which is close to zero since parameters q are generally much smaller than national income. We conclude that dq / dy = dq / dY is not quite zero, but practically zero, and this seems to corroborate the conventional reaction to the DMR.
Hence the conventional reaction to the DMR is that the DMR does not change the traditional analysis on marginal rates. Hence there is no hope for unemploment along these lines. With ongoing technological growth and competition of low wage countries, only the flexibility of labour markets will help to reduce unemployment, even if this means a reduction of net minimum wages. That, at least, is the conventional reaction.
However, Keynes (1936) explained that proper dynamic analysis inherently means that we have to consider expectations.
In this case the agent will be aware that parameters are indexed in some manner. Due to indexation, the term dq / dy can take significant values. Let q be indexed on national income growth Y. For many tax functions the indexation of parameters may take the form dLog[q] = dLog[Y] - as can be done for exemption and curvature of Tax[y]. If dLog[q] = dLog[Y] then
![]()
This again may reduce to the q / Y above. However, if we take expectations of the growth of national income, which means that the agent assumes that the other incomes do not remain constant, then:
![]()
Thus, next to knowledge about indexation, the agent will have expectations about the national income growth dLog[Y], and compare his own growth of income dLog[y] to this expectation. In terms of expectations, dq /dy does not vanish to zero. This is especially relevant when the parameter q gives exemption x that is a sizeable part of income.
So there is hope for the unemployed.
Above can also be formulated in discrete form. Indexation generally takes place with a lag, and then the discrete DMR is more adequate. This is:
DMR[y] = (T[y, q] - T[y-1 , q-1 ]) / (y
- y-1 ) =
T /
y
Book III gives a development for the Bentham tax function, and also gives plots for regular numerical values. It appears that indexation and expectations about the growth of national income (relevant for indexation) again lead to other results than the conventional view on marginal rates.
There is one area where the DMR cannot easily be overlooked. This is the area of policy simulation, where tax adjustment cannot be neglected. For sure, empirical analyses and government projections indeed deal with tax parameter changes. For example the well-known Reagan tax cuts were put into the forecasts at that time. However, we should wonder now whether the methods have been right. The analysis above focusses our attention on the impact on individual behaviour, where we regard the marginal calculation by agents themselves.
Let us regard policy simulations using common practical economic models. Let us for example regard the effects of a rise of government investments as financed by taxes, for a sustained period of 8 years (two presidential terms). To do a simulation properly, the tax function used must reflect government policy, which includes indexation. For example, exemption and other brackets are adjusted for last year inflation while the statutory marginal rates remain the same. The different investment paths result in different paths for the taxes. This is not just a model result, but also the agents in the economy would encouter different regimes. Thus the model generates different dynamic marginal rates, while the agents are assumed to react only to the same (static) rates. The situation gets even complexer when the alternative policy includes a different indexation scheme, such as indexation of taxes on national income. All this means, then, that we are justified in doubting the validity of current modeling practices. Modelers should start wondering about this kind of dynamic consistency (not to be confused with the ‘dynamic consistency of policy’ as another topic in economic literature on ‘credibility’).
It might even be, then, that the best way to understand the dynamic marginal rate is to see it as a solution to this kind of dynamic inconsistency.
Under balanced growth, taxes will grow as fast as incomes, with a constant tax share TAX / Y, assuming proper indexation of the tax parameters. A result will be that the dynamic marginal equals the average tax rate, for all individuals. Book III already mentioned the key relationship here, in property (13.3e).
We use Tax[.] for an illustration. Here a solution for a balanced growth path is that parameters x and c are indexed on y. With the index for y as i = P ryi ( i > 0), we find for the (individual) average tax burden that the index drops from both numerator and denominator:
T[ i y; r, i x, i c] / (i y) = r (i y - i x) / (i c + i y) = T[y; r, x, c] / y
(Less relevant, (29.1) remains the same too.)
The situation of a constant dynamic marginal rate is depicted in Figure 30.
Figure 30: A balanced growth shift
A-2A: constant frequency, A-C: the same average tax

Let us take the example of a doubling of income. Point A is an arbitrary point on the employment density. We scale the density so that A also lies on the tax function (H). For that arbitrary income at A we determine the average tax as a ray through A and the origin. Now, if all incomes double, then the employment frequency density shifts, and A becomes 2A. If tax parameters x and c double too, then the tax function becomes (2H). At 2A the individual pays tax C, which is the same average tax as in A (vide the straight line through origin, A and C).
Income growth means a shift of the employment density or the earnings distribution. Earlier we looked at income distributions for Holland 1950 and 1988, and the reader may now better understand why. The Dutch distributions could be approximated by lognormal distributions, but the mean, variance and the size of the labour force changed. Taxes also have been indexed on inflation instead of income. So we may surmise that there was no balanced growth.
How do agents react when there is no balanced growth ? Indexation to national income can be said to be “neutral to the income change”. The tax choices facing an individual, whose income grows as national income, are constant. The utility reaction thus depends on the change of income itself. It may be that an individual, whose income might grow as fast as national income, decides to grow differently, either more or less, depending upon his leisure-income utility. Since the context is that all individuals are adjusting, this may be reformulated as that individuals are determining their place within the income distribution.
Our analysis thus suggests that tax incentives primarily affect decisions about one’s place in the income density. Any individual change that differs from the national average can be interpreted, or defined, as the individual decision to accept another place in the income distribution. It would be interesting to reinterprete economic models on growth in these terms, and see whether elegant regularities can be found or constructed. However, it leads too far to really look into this matter, since it is not our proper subject.
We conclude that indexation and expectations about the growth of national income (relevant for indexation) lead to other results than the conventional view on marginal rates.
The tax wedge at the minimum is caused by differential indexation, and makes for a higher gross minimum wage. This has been clarified above. A second point is curvature. Due to curvature, the wedge comes close to its limit value for already low levels of productivity growth. Thus, the negative effects of the wedge occur primarily at the onset of economic growth, and are less noticeable when stagnation has already set in. This already has been indicated above, but the argument can be developed by giving formulas and plots. Especially, it are the plots that may help us to understand that the major distortionary effects took place in the 1960s and 1970s. People looking only at the events in the 1990s are less likely to see the root of the problem.
In the following we first derive the formulas and then give plots for the average tax rate (ATR) and the gross-to-net ratio (GNR). The latter ratio may better express the effect on the gross minimum wage. We find that the ATR and the GNR at the minimum rise faster than for other incomes, since the minimum itself moves faster than those other incomes. For ease of exposition we use the Bentham tax.
The average tax rate (ATR) and the gross to net ratio (GNR) are:
ATR[y] = Bentham[y] / y = r (1 - x / y)
GNR[y] = y / (y - Bentham[y]) = y / ( (1 - r) y + r x) = 1/ (1 - r + r x/y)
Examples work best. Let subsistence B be exempt from taxation so that x = B, and let the marginal tax rate be 50%. The average tax rate (ATR) of a subsistence worker then is 0, and the gross to net ratio (GNR) is 1. At twice subsistence, the tax is 50% (2 B - B ) = B / 2, and thus the average tax is 25% and the gross to net ratio of 4/3. In the limit, i.e. when exemption has been reduced to a negliglible proportion, then the average tax equals the marginal rate of 50% while the gross-to-net ratio is 2.
Next, notice two points. First, the formulas by themselves do not quite show how quickly the limit values are approached. To answer this question we can best look at some graphs. Secondly, these examples are static, i.e. at one point in time for different incomes. Thus, when we make graphs, then we can use a static index, and compare an income level 1 to an income ten times as large. In dynamics, i.e. when incomes rise, things are a bit complicated.
In dynamics, and concerning the current practice of adjusting exemption for inflation, we can take exemption as constant, and look at real incomes (adjusted for inflation). It seems as if we can take the formulas and graphs of the statics case, and compare real incomes regardless of the time. However, in dynamics, ‘minimum income’ is not just ‘income’ but is a mechanism. The concept of M is that it picks out one income as the minimum, but it can pick that income at a different rate of growth depending upon the mechanism. The interaction between indexation, net subsistence, the tax parameters cause a multiplier effect. Before we make plots we have to develop on this.
Let us first regard a general formula for dynamics, and see that it seems as if there were no difference with the formula for the statics case. Let exemption x be adjusted for inflation with index P, then x = P x[0]. Here we assume that x[0] can differ from subsistence in the base year B[0]. Let y be adjusted for the real level of income, with index rwi, too; then y = P rwi y[0]. Define f = x[0] / y[0]. Then:
ATR[y] = r (1 - x / y) = r (1 - x[0] / (y[0] rwi)) = r (1 - f / rwi) = ATRwi[f, rwi]
It must be noted that y[0] depends upon y, so that f may take continuous values. ATRwi[f, rwi] expresses that if we have a value of y, then we could interprete this as deriving from various combinations of f and rwi as long as rwi x[0] / f = y. The dynamic ATRwi[f, rwi] thus seems no different from the static ATR[y]. The complication however comes from subsistence. We cannot regard M as a normal case of y = P rwi y[0].
Denote the average tax at the minimum wage as, ATR M [rwi]. We will use the suffix ‘M’ in general to signify this dynamic point of view. [99]
In Book III we derived the real subsistence index rsi for the Bentham function when x = P x[0], so that B = rsi P B[0].
(13.3d)
Then:
M
= B + Bentham[M]
M = (B - r x) / (1
- r)
M = (P rsi B[0] - r P x[0]) / (1 - r)
m = M / P = (rsi B[0] - r x[0]) / (1 - r) = m[rsi]
ATR M [rwi]= ATR[m[rsi[rwi]]]
We can develop this a bit further, using j = x[0] / B[0]:
GNR M [rwi]= M / B = (1 - r x[0] / B[0] / rsi) / (1 - r) = (1 - r j / rsi) / (1 - r)
ATR M [rwi]= Bentham[M] / M = 1 - 1 / GNR M [M] = r (1 - j / rsi) / (1 - r j / rsi )
Over time, rsi will rise to infinity, and limit
values will be GNR[
]
= 1 / (1 - r) and ATR[
] = r as for all incomes.
First we plot the static ATR and GNR for values of a real net wage index from 1 till 10. Figure 31 plots the paths for various marginal tax rates: 10%, 20%, ..., and even 70%, all assuming x = B = 1. These plots show the point made earlier, that the ATR is close to the marginal rate at already low income values, e.g. 2 or 3 times subsistence.
Figure 31: Average tax, in statics,
for various marginal tax rates

We might interprete static Figure 31 in a dynamic way. Take B[0] = x[0] = 1, j = 1. We may take a theoretical example. If you have a period of 35 years, then a real growth of 2% per annum would suffice to double incomes. So in the standard unrefined analysis, the tax creep in 35 years would cause incomes to be taxed at average rates close to the marginal rate. [100]
The more refined analysis for the minimum wage takes account of the multiplier effect. First of all, if real subsistence doubles from B[0] = 1 to B[35] = 2 B[0], the gross minimum wage would be M = (2 - ½) / ½ = 3, and hence we should look in Figure 31 at index 3 instead of index 2. This issue however is a bit more complex, since when rwi = 2, rsi is not 2 but 1.7.
In Figure 32 we compare the standard ATR and the dynamic ATRM. We regard only one marginal rate (a 50% rate) and a ‘peg average’ W[0] = 2 B[0] or h = 0.5. It appears that the dynamic ATRM is steeper and higher than the static ATR. However, the difference is not that big. Note though that we would want an average tax rate of 0 for the minimum wage (subsistence) instead of something close to 30%.
Figure 32: Average tax rate,
static and dynamic, for r = 50%


In Figure 33 we regard the dynamic GNRM ’s, now plotted for various values of r. We can see that the rise is largest in the lower reaches of the graph. For example the 50% rate already reaches the level 1.6 around the index value of 4, and 1.6 does not differ much from the limit value of 2.
Figure 33: Gross-to-net ratio, in dynamics,
for various marginal tax rates

Some sectors of the economy are exposed to foreign competition and some are sheltered from it. These exposed and sheltered sectors are likely to have a different composition of their labour force, notably different rates of dependency on the minimum wage. If a national incomes policy does not respect these differences, a country can have both unemployment and a surplus on the trade account.
The two Oil Crises in the 1970s created a problem for the Dutch economy which has become known in the literature as the so-called “Dutch Disease”. When the price of a nationally produced but internationally traded resource rises - and this happened since Holland is rich in natural gas and a free rider of OPEC - then this causes the exchange rate to rise, and then this indirectly causes a reduction of the other exports and an increase in competing imports. Thus the original increase in national wealth paradoxically combines with an increase in unemployment - and eventually a lower growth path.
This chapter concerns the Dutch policy reaction to that Dutch Disease. If policy is not targetted at stabilisation of the exchange rate by monetary means and capital flows, but at tinkering with the labour market, then the situation - the disease - can grow worse.
Our analysis will use the distinction between the ‘exposed’ and the ‘sheltered’ sectors of the economy - a distinction that originates from Swedish analysis in the 1950s (Meidner c.s.).
The Dutch policy reaction - though with some lag - was a general restraint of wage growth. This reaction was motivated by reference to the so-called Vintaf model developed by Den Hartog and Tjan at the Central Planning Bureau - see Driehuis & Van der Zwan eds. (1978) and Driehuis, Fase & Den Hartog eds. (1988). [101] The direct assumption was that high wage costs cause the scrap of old vintages of the capital stock, resulting in an irreversible loss of capacity. The indirect presumption was that a relative reduction of production costs could compensate for the rise in the exchange rate, restoring competitiveness and employment. [102]
However, in a quite brilliant exposition that up to now has been neglected to the shame of the Dutch economics profession, Marein van Schaaijk (1983) of the same Bureau showed that a general wage restraint neglects the fact that the exposed and sheltered sectors have a different composition of their labour force, with important effects. He noted that the exposed sector is industrial and has the larger share of well educated, highly productive or high value added labour; while the sheltered sector concerns services and has the larger share of lowly educated, lowly productive or low value added labour. A uniform wage restraint - targetted at reducing unemployment rather than balance on the external account - is too high for the exposed sector and thus subsidises exports; and the restraint is too low for the sheltered sector and thus generates unemployment. The restraint of incomes also means a restraint of imports, aggravating the situation. So Van Schaaijk noted in fact both the internal and the external imbalance, recognised that these mirrored each other, and that these were prolongued, now not by the original energy price hike but instead by policy.
Indeed, Holland since then has a strong external position - exporting unemployment to Europe - and a high internal unemployment - where the unemployment is hidden in ‘disability’ (and hence registered by dull statisticians as ‘low participation’). Some surplus of the external account is reasonable given the natural resource, and the capital flows for foreign investments are useful for when the resource is depleted. But the Dutch external surplus is excessive.
Van Schaaijk’s suggested remedy was standard and sound. It was and is to let wages develop in line with productivity. Since Dutch policy is oriented to maintaining a more equal distribution of income - which explains part of the policy drive to see a uniform development in wages - Van Schaaijk advised to use tax policy to correct the differential development of gross wages for its effect on net incomes.
However, as said, Van Schaaijk’s analysis has been neglected to this day, and Holland now suffers from a long period of unemployment and a trade surplus and a general restraint of wages and net incomes. There is a curious ‘consistency’ in the delusion with policy makers, that incomes restraint is required to maintain employment by generating a trade surplus, since, by restraining the home market, most Dutch employment growth seems dependent upon trade indeed. Strangely, economic developments caused the Central Planning Bureau to drop the Den Hartog & Tjan model in the mid 1980s, but the policy of wage restraint remained.
In the 1982-1991 period I worked at the Central Planning Bureau too, and had the opportunity to get acquinted - albeit around 1986 only - with Van Schaaijk’s analysis. Apart from being enlightening by it itself, it opened my eyes - even while it was standard - to the importance of tax policy for unemployment, and thereby led to my papers (Colignatus (1989-1996)) and this present book, on the solution to the current mass unemployment in the OECD countries in general.
In my papers I have always referred to Van Schaaijk’s 1983 article whenever it was proper. However, in this chapter I have occasion to more specifically combine his analysis with my own. This chapter improves on Colignatus (1996g), and as I wrote there: this combination of our analyses has been in my mind for a long time, but there was no time to develop it, as, in fact, this chapter suffers from some time constraints too.
We shall use a general equilibrium model where the exposed and sheltered sectors have different combinations of labour as in the Van Schaaijk observation. But now we take my analysis on the minimum wage, and let the minimum wage have the differential impact. This is more relevant for the OECD in general. Note, though, that I do not want to imply that all OECD countries have a trade surplus; other conditions are relevant here too, of course.
Due to lack of time we use a closed model. Thus we cannot reproduce the external imbalance. But we can reproduce the difference in reactions of the two sectors. We may study situations with full employment (1950-1970) and without this (1970-2005). Below, we give a model, tables and graphs.
Regard a general equilibrium model with 15 units of highly productive labour (h), 75 units of modally productive labour (m) and 10 units of lowly productive, minimum wage workers and possible benefit recipients (l). The economy has exposed and sheltered sectors that produce output yE and yS, while a social welfare function (SWF) determines the optimal combination. In an open model, the yE would be traded for yForeign, but here we assume that exports are directly equal to imports for consumption. The SWF will here be a Constant Elasticity of Subsitution (CES) function that neglects the distribution of income:
![]()
Output of the sectors is determined by production functions that depend upon the allocation of the labour factors h, m & l. Since we will compare two regimes, one with l and one without l, this factor cannot be complementary (necessary), and hence it is substitutable to some degree with the other factors. The sheltered sector is a one level CES with all factors substitutable:
![]()
The exposed sector is a two-level CES where highly and lowly productive labour are complementary, but both are substitutable with minimum wage labour:
![]()
The coefficients have been chosen so that these outcomes resemble a real economy. We should refrain from making our conclusions too specific though, since the coefficients are arbitrary.
We consider two regimes, one With l (i.e. the minimum wage M is not binding), and one Without l (with M binding, causing unemployment and lower national income). Subsequently, the model is run with the computer program listed in the appendix; see chapter 37 for another application of the computer routine (and additional explanations of terms).
Figure 34 plots the production possibility curves and the SWF indifference maps of the two situations. The regime with a binding minimum wage - and less workers - indeed has lower production and lower utility. The drop in production in the sheltered sector is larger than in the exposed sector.
Figure 34: Production Possibility Curves & Indifference Maps

Figure 35 plots the Edgeworth-Bowley diagram for factors h and m, with Sheltered in the lower left and Exposed in the upper right. The movement is upwards along the contract curve. The highly productive workers in the second regime become relatively scarce, and command a relatively higher share of national income. [103]
Figure 35: Edgeworth-Bowley Diagram

The following tables give the numerical outcomes of the two regimes. When M is binding, the subsistence workers l are unemployed and dependent on a benefit. Since they do not work, output and social welfare are lower. Though there is no explicit social security in this model, we however can presume that part of earnings of the workers is channeled to the unemployed, leaving consumption from those earnings unaffected.
The social optimum is found as in Table 9. The associated allocations are in Table 10 - left and right side. When you compare the two regimes, please note that the prices are normalised per regime to a unit price for the sheltered sector, and thus are not comparable over regimes.
Table 9: Utility, production and national income for two regimes
|
|
Utility level |
National income |
Product prices |
Production |
||
|
With l |
21.20 |
39.67 |
1 |
0.9579 |
24.93 |
15.38 |
|
Without l |
18.16 |
32.37 |
1 |
0.840 |
20.74 |
13.85 |
Note: All prices are scaled so that the
product price of the sheltered sector = 1.
This is also done per regime, so
that the price levels over the regimes are not comparable.
In Table 10 we see that the share of the highly productive in national income rises. Most of the share of the l go to the m, but this is generally viewed as an internal redistribution, and most attention goes to the share of ‘the rich’.
|
|
Allocation with l |
Allocation without l |
|||
|
|
High |
Middle |
Subsistence |
High |
Middle |
|
Labour units Sheltered |
6.53 |
53.08 |
9.57 |
7.07 |
54.73 |
|
Labour units Exposed |
8.47 |
21.91 |
0.43 |
7.93 |
20.27 |
|
Labour units Total |
15 |
75 |
10 |
15 |
75 |
|
Wage |
0.88 |
0.33 |
0.19 |
0.74 |
0.28 |
|
National Income Share |
0.33 |
0.62 |
0.05 |
0.34 |
0.66 |
By proper choice of functions and parameters we have succeeded in reproducing and hence illustrating the Van Schaaijk observation & analysis of the differential reaction of the exposed and sheltered sectors on incomes policy. As Van Schaaijk found, the sheltered sector loses most, and it would be optimal to have wages reflect productivity. And similarly, this can be supported by tax policy. Whereas Van Schaaijk commented on the Dutch policy of the uniform containment of wage growth, we have concentrated on the minimum wage - as is more applicable for the OECD. Indeed, if the whole of the OECD would try to copy the ‘Dutch model’, then this would amount to trying to export unemployment to each other, and a thing like that surely would not work.
In chapter 25, the ‘more sophisticated view’ section, we mentioned that Graafland (1990b) elaborated on Hersoug (1984), and recently again in Graafland & Huizinga (1999). The approach here is a Nash solution to wage bargaining. The approach causes that marginal tax rates penalize wage demands and increase employment - contrary to the common thought that statutory marginal tax rates reduce incentives and hence reduce employment.
We ourselves forwarded the novel insight of the ‘dynamic marginal tax rate’: saying that marginal tax rates should be better measured by also including expectations on parameter changes and economic growth.
The question now arises how these two approaches combine. The Nash approach uses partial derivatives, while the dynamic approach uses total derivatives. If we would take the total derivative of the Nash solution, it might well be that statutory marginal tax rates show an effect again that is more in line with the conventional view. The four possible combination cases are shown in Table 11.
Table 11: Two marginal approaches for two Phillipscurves
|
|
Phillipscurves |
|
|
Marginal approaches |
Traditional: only labour supply |
Nash bargaining |
|
Standard marginal analysis |
(1) the marginal tax rate has a disincentive on labour supply and thus causes wages to rise |
(2) the marginal tax rate has a disincentive on wage claims |
|
Dynamic marginal tax rate |
(3) the marginal tax rate has no disincentive, relevant is the average tax |
(4) ? |
I have not performed the analysis yet. By the next edition of this book I should have. My intuition however suggests - and I keep an eye on reality - that the two approaches only combine into a stronger argument against the conventional view. Doing this additional work thus currently is expected to be a bit overdone just now.
The following has been in my mind since Colignatus (1989) but was not stated in the first edition of this book. One of the key points of Keynes in the General Theory was that the true, real, savings of an economy consist of what is invested. All the money that people save does not count as an investment or real saving. Whatever amount they bring to the banks or even hide under their beds, it is only money. One can have nominal saving S and price level P, but the division S / P is more psychological than real. What counts are the houses built, bridges constructed, lessons learnt, all that can be carried over to the next period. In fact, a company that produces but can’t sell and goes bankrupt might actually do society a favour, since at least some goods have been produced which otherwise might not have come into existence. The challenge is to get production and investment without such perceived incompetence or fraud. The economy should be designed so that those investments come about in an optimal way, where the optimum must be defined not only in terms of expectations and stability but also in terms of social welfare and full employment.
Governments, especially European ones, have been experimenting since World War II with all kinds of methods to control investments, but have been confronted with two major outcomes: (a) unemployment remained high, (b) many investments were considered failures. The economic paradigm since the Reagan years has been to let investments be determined by the market. Also Dutch social democrats like Wim Kok supported this approach, since it was thought that employment depended upon growth while growth depended upon the best investments that the market could provide. This paradigm led to reduced government outlays, less fiddling in the market, privatisation, and reduced taxes for the wealthy who were assumed to do the investing. The 1990s showed the boom associated with silicon valley - though should properly be associated also with this policy and the implementation of new financial instruments. But the boom went bust and the world was reminded of the logic of Keynes’s depression economics, see Krugman (1999).
The point of criticism is that employment and growth are rather separate issues. Our own analysis in this book shows that a return to full employment is possible. The main instrument is to get rid of the tax void. Employment does not depend upon growth per se but employment depends upon a properly working system to allocate the work that is being done in an economy. Growth comes only into the story when we aspire at higher welfare by means of higher productivity. If we don’t want growth, we can easily imagine a stagnant economy. That said, most economies aspire at a growth in welfare. We can do this by designing new products or by material investments or by creative ways to reorganise production. [104] Then the problem returns of optimising investments that define real savings. Since some sections of the economy are devoted to investments, there is also the Keynesian phenomenon that investments influence activity, income and nominal savings.
The paradigm to ‘minimize’ the role of government in investment was misguided since the relation between growth and employment was misspecified. Now that we know that the tax void was the main cause of stagflation we can reconsider the paradigm. The argument that remains is that government meddling supposedly caused failed investments. The answer to that argument is (i) that failures must be judged on a case-by-case manner, by Cost Benefit Analysis, and (ii) that one should include the concept of Keynesian recession and that some investments might seem a failure but actually are beneficial. Note that there is no need for a government deficit since the analysis on the dynamic marginal rate shows that progressive taxes need not be a drawback for the richer. If growth is the issue, then the true issue is its optimality in terms of level and composition and effects.
The line of thought that I would suggest is that this optimum requires competing investment banks that develop plans during the economic upswing that can be implemented during the economic downswing. Who worries about pensions and the EU Lissabon Strategy is advised to consider this approach. Since the market is an anonymous beast that may or may not generate such competition, it remains the challenge for governments to mastermind and manage it all.
Kenneth Arrow (1950, 1951, 1963) presented an Impossibility Theorem in which he showed that decisions about ‘the general welfare’ are impossible in certain cases or have to be left to a dictator. Arrow presented some five axioms that each seemed reasonable when considered by itself, and he argued as well that these axioms are morally desirable and fitting to the concept of ‘general welfare’. He also formulated the problem in general terms so that it concerns choices on goods or people. Subsequently, he derived a contradiction. This result caused quite some consternation, but eventually the mathematical rigour caused acceptance, and since then the Theorem forms the core of many books, such as Sen (1970) and Mueller(1989). The Theorem was also one of the reasons to award Arrow the Nobel Prize in economics.
A voting example is given by the US Presidential election of 2000. Apart from the problems around the ballot process itself, there was a more basic problem: with main contenders Bush, Gore and Nader, Bush got elected, but in another system, such as a run-off between the two ‘major’ contenders, the Nader vote apparently would have switched largely to Gore, making him the US President. So the choice depends as much upon the system chosen as on the preferences. Can we find a generally good system ? Arrow’s Theorem suggests ‘No’.
Arrow’s Theorem has had a huge influence on scientific and political thought. Part of this influence is subtle, where skepsis arises about the concept of ‘democracy’. That shiny goal loses its appeal when we don’t know how representatives should be elected and when morally desirable rules would be impossible. Opting for the natural forces in the social process may be more pragmatic. The influence of the Theorem can sometimes be more explicit. Next to the model of the utility maximising individual, there is the model for society as a whole and then the maximisation of a Social Welfare Function (SWF). But when a morally acceptable SWF is impossible, what would be the use of research into such an inherently flawed concept ? Many nations co-ordinate their economic policy, and have created institutions for this, like the Council of Economic Advisors (US), the Commissariat du Plan (France), the Sachverständigenrat (Germany), and the Central Planning Bureau (Holland). Such an institution, given its role in the co-ordination of economic policy, could be expected to do reseach on the national SWF. However, those institutions tend to abstain from that kind of research, pointing to Arrow’s Theorem as one of the arguments, if not the major argument.
Over the years an ‘accepted view’ has grown in economics concerning the meaning of Arrow’s Theorem. This accepted view however has also implied a kind of moral stagnation.
There are two main reasons to reconsider the accepted wisdom on the meaning of the Theorem and to rekindle the debate on it. The first reason is destructive, since it rejects Arrow’s position; the second reason is constructive, since it provides an alternative.
These reasons are: (1) There is a distinction between the mathematical framework on one hand and its interpretation on the other hand. The Theorem holds, and the impossibility holds for Arrow’s axioms, but the questions of reasonableness and moral desirability are of a different kind. (2) The area of application of Arrow’s axioms seems rather static, while reality is dynamic. By considering the role of time, there is more scope for morality, and then one can identify a voting procedure that many would find attractive.
The two following chapters develop these arguments subsequently. Readers interested in more details are referred to Colignatus (2001), “Voting Theory for Democracy”. That book develops the theory of direct single seat elections from the bottom up while it also provides programs (in Mathematica) to eliminate the tedious work of the calculations of the various voting procedures.
Arrow’s Theorem holds that no constitution can satisfy certain properties. In annex to that theorem, Arrow claims that those properties are reasonable and morally desirable. In Arrow’s view there thus is the difficulty that people desire a constitution that cannot exist. While the Theorem stands as a mathematical result, the additional claims concern some other matters, namely the domains of reasonableness and morality. It are these claims that have caused much confusion in the literature. It is shown here that the claims are unwarranted, since inconsistent properties are neither reasonable nor morally desirable. It is shown too that Arrow’s axiom of Pairwise Decision Making (formerly known as the Independence of Irrelevant Alternatives) is not realistic, and thus unattractive. We show the existence of some constitutions without that axiom that are consistent and might be optimal to many. The major error made by Arrow and his students is to mix up the context of scientific discovery and learning with the context of application to the real world by educated people.
Arrow (1950, 1951, 1963) showed that if certain properties are postulated for a constitution, then such a constitution would not exist. This result has been checked by numerous scholars, is accepted by this author, and thus stands as a mathematical theorem. In fact, we will give a short proof below.
Arrow also claimed, annex to the theorem, and this will be at issue here, that those properties would be reasonable and morally desirable. He recently repeated that claim in the Palgrave (1988:125). He writes:
“(...) conditions to be imposed on constitutions (...)”
“(...) there is no social choice mechanism which satisfies a number of reasonable conditions”.
For clarity it is useful to introduce the following abbreviations for the theorem and its companion claims, and their conjunction:
AT = the Arrow Theorem
ARC = the Arrow Reasonableness Claim = the properties are reasonable
AMC = the Arrow Moral Claim = that they are to be imposed
AGV = the Arrow General View = AT & ARC & AMC
Note that Arrow’s phrasing on ARC and AMC is a bit ambiguous. The “to be imposed” might not be moral but merely logical, in a sense that one needs at least some conditions to make a constitution. However, the topic of collective choice is distinctly a moral one. Secondly, Arrow emphasises what is to be imposed and what is reasonable, but he may not be in a position to impose his views and morals on us. The best interpretation of the situation likely is as follows. Presume that Arrow sees the Founding Fathers at work. He then retreats to his office, and conjectures: ‘If I interprete correctly what they want, then it are these properties.’ Thus the ARC and AMC are not quite Arrow’s personal ideas. Above quotes can best be interpreted as factual statements on what people apparently want and consider reasonable.
Arrow’s general view has been accepted in many places in the literature and textbooks, see Luce & Raiffa (1957), Johansen (1969), Sen (1986) or various other entries in that same Palgrave. For example, Tobin (1990):
“We know there is no way to aggregate individual preferences into social rankings (...). As if this were not obvious, Kenneth Arrow proved it rigorously years ago. The impossibility applies to aggregations across contemporaneous cohorts, a fortiori across generations living and unborn.”
In a much used book on Cost-Benefit Analysis (CBA), A.K. Dasgupta & D.W. Pearce (1980):
“(...) no escape route (...) seems yet to be available.”
Apparently feeling that Arrow's argument destroys the foundations of CBA, they find themselves forced, rather grudgingly, to reduce CBA to something like information gathering.
In an otherwise recommendable volume of Statistical Science, Gill & Gainous (2002) find:
“In fact, he proved that unless one is willing to violate one of a set of reasonable democratic norms, (…inconsisteny...) is an inevitability. (…) Therefore, collective social decisions cannot yield a truly democratic system in this sense.”
Jorgenson (1990), once president of the Econometric Society, concludes ‘more positively’ to dictatorship:
“The classic result of social choice theory is Arrow’s (...) impossibility theorem, which states that ordinal noncomparability of individual welfare orderings implies that a consistent social ordering must be dictatorial, corresponding to the preferences of a single individual.”
Not everybody falls for dictatorship. The impact of the AGV generally comes from the fact that people find themselves, either from moral obligation or from reasonableness, wanting the impossible. And many simply stay in that fixture.
Note the subtlety in that fixture. The impossibility is logical and not just empirical. An example may help. Let me confide that I want to found a new university on the island of Crete. However, I am not that rich, so I want something impossible. This however does not put me into a fixture, since I am used to the fact that I cannot afford some things that I want. However, the Arrow general view concerns a logical impossibility, which is something quite different.
We can usefully recognise:
reasonable = rational & realistic
Reasonableness is the intersection of rationality and empirical realism. Nonexistence may derive from empirical circumstances or from logical impossibility. Irrationality however is always unrealistic. Inconsistency cannot exist, in the true empirical sense. For example a round square cannot exist. The nonexistence of the Arrowian constitution similarly derives not from empirical reality but from logical necessity.
Given the AGV, the question arises what the reasonableness and moral presumptions of Arrow’s claims actually are. Are these claims as strong as conjectured ?
My position is as follows:
1. As has been said on ‘round tables’, it is not rational to postulate inconsistent properties. People involved in a learning process may indeed make inconsistent assumptions. However, once the inconsistency is discovered, it is no longer considered to be rational to adopt those assumptions. People may enjoy ‘roundness’ and ‘squareness’, but having both simultaneously is seen to be inconsistent, even inconceivable, and hence unreasonable. The Arrowian properties are unreasonable in the exactly same manner. Arrow’s pitfall is to confuse the learning process, his context of discovery, with real world applications by educated people.
2. Similarly, one cannot be morally obligated to a logical impossibility. Hence Arrow’s properties are morally undesirable.
These points will be clarified below.
Note that people have in practice rejected some of Arrow’s properties. Even those scholars who seem to accept the general claim AGV, accept, a fortiori, the implied inconsistency, and thus in practice drop some assumptions to cope with the real world. Unfortunately, however, the literature has not converged to some agreement on which properties are best to drop. The position of this paper will be to forward the proposition that the Arrow axiom of Pairwise Decision Making (formerly known as the Independence of Irrelevant Alternatives) is the culprit to kill. It is a bad axiom for rational collective decision making, since it appears to be incongruent with that very notion itself.
In the following we develop the concepts, give a short proof and discussion of Arrow’s Theorem, construct the argument against the claims, reappraise the literature, and conclude.
Please note that we will have to redefine some symbols for this chapter only.
Let X be the commodity domain. An element in the
commodity domain can be called an item or a candidate. An agent is a compound
of various properties such as utility, wealth etcetera. Let S be the set
of possible compounds on X. With n agents, our interest concerns
the function c: Sn
S. which maps the society into an aggregate
compound. This is generally called the ‘Arrow type of social welfare function’
or simply a constitution.
A constitution differs from the ‘Bergson-Samuelson type of
social welfare function’ (SWF) - and the latter is defined directly over X as
SWF: X
[0,
).
Arrow’s Theorem concerns Social Welfare Function Generating Mechanisms (SWF-GMs) like the c above. Thus, a constitution can be seen as a mechanism that uses the population as input and generates a SWF that orders all elements in the commodity space. This can be compared to a Social Decision Function (SDF) that selects only one element, namely the best of a budget set. This can be weakened further by considering preference orderings instead of functions. Constitutions generally associate better with SDF-GMs since parliaments generally don’t care ordering all proposals. However, these concepts can be translated into each other via varying the budget set. Since the SWF is the conventional concept in economics, the word “constitution” can remain associated with a SWF-GM.
It suffices to restrict S to preference orderings.
These orderings satisfy reflexivity, transitivity and completeness. It is
important to add that there is no cheating. Let R denote normal
preference, P strict preference, and I indifference. When there
is no confusion, we can also use the symbols
, < and =. A suffix denotes an individual
preference, otherwise it is the aggregate. An element in Sn is
called a profile, and R = c(R1, ...Rn).
There are the following Arrowian axioms:
AWP the weak Pareto principle
AU universal domain (wide ranging preferences)
AD no dictator
APDM pairwise decision making (the axiom
f.k.a. independence of irrelevant alternatives)
a AWP & AU & AD & APDM.
The Arrow Theorem can be expressed in various equivalent logical forms:
AT a
falsum
AT’ a
~a
AT” ~a
AT”’ (AWP & AU
& APDM)
~AD
with falsum a contradiction or falsehood and ~ the negation sign. If something leads to a contradiction, then we conclude to the falsehood of the assumptions themselves.

There is a Kantian distinction between technical, pragmatic and moral (categorical) imperatives. Utility, as commonly regarded by economists, likely is of the pragmatic kind. Interestingly, theorists on morality have developed something called ‘deontic logic’, which appears to give many similar results as economic theory. Deontic logic however applies to propositions and not to commodity domains. It is possible, though, to integrate all these kinds of preferences into an integral utility index, when we replace a point x in the commodity domain by a statement “The state of the world is x”. This integral utility index likely would be lexicographic, in that some moral and constitutional issues might dominate pragmatic results in the commodity domain. Thus, while we would use the same symbols R, P and I, we would need to look into the structure of the index to find the Kantian distinction as made by the particular agent. We conclude that we can usefully introduce and apply some terms from deontic logic. Define:
Ap
(~p
p) means that p is allowed
(at least as good as ~p)
Op
(~p < p) means that p is
a moral obligation (one ought to p)
An exemplaric deontic result is:
Op
~(A(~p))
Deontic logic allow us to translate:
AMC = Oa
The use of deontic logic allows a forceful restatement of Arrow’s difficulty in social choice:
Oa & ~a

Let us consider some more properties of morality and deontic logic.
The gap between Is and Ought (Sein und Sollen) means the rejection of
p
p
Op (‘If
something is, then it should be like that’) and, in principle,
p Op
p (‘what ought to be,
is achieved’).
Note what this actually means. A statement p has a
truthvalue 1 (true) or 0 (false), depending upon the state of the world. A
statement Op has a ‘truthvalue’ 1 (ought) or 0 (not-ought) depending
upon one’s preferences. Applying the logical calculus for the propositional operators
, ~,
, & thus is a
mental exercise, where empirical and preferential statements are first given
the common denominator of ‘accepting as valid’. Also, it may be that in one case
both p and Op are accepted, but the rejection of
p p
Op means that it is
rejected as a rule. [105]
Moral consistency is reflected in the Deontic Axiom:
DA
p,q (Op &
(p
q))
Oq
There is some discussion between moral theorists whether DA
really holds. It may be felt that the logic is not very compelling for
empirical relations of dubious causality. However, if p
q reflects a logial
truth, then DA is commonly accepted.

On reasonableness, it seems a bit better to attach the properties to the agents rather than to the propositions or commodities. Useful axioms then are:
AF feasibility, X is the budget set (rather than the whole space)
ARe agents are realistic (they only consider feasible options, accept AF)
I thus agree with Arrow’s 1950 statement: “My own feeling is that tastes for unattainable alternatives should have nothing to do with the decision among the attainable ones; desires in conflict with reality are not entitled to consideration.” Thus, also, when one point is (socially) most preferred, it is the one consumed.

The most complex property seems to be good old rationality. It appears that we better introduce the information set or knowledge base I(.) and state the condition that it must contain the Arrow Theorem. Then:
ARa agents are rational (they accept logic, [106] have a preference ordering, are morally consistent (DA), and are educated on Arrow’s Theorem (I(~a)))
The I(~a) condition is a novel aspect, that, however, should not come as a surprise, given what we said in the introduction. There is a difference between a learning process and a result. In a common classroom or used-car-salesman strategy, people are goaded into buying some axioms as reasonable and attractive, and then burn themselves, which teaches them. This may be called rational from the viewpoint of learning. This paper however concentrates on the after-learning-rationality, the kind of rationality that makes learning so worthwhile.
How does Arrow’s original approach relate to the inclusion of I(~a) ? Arrow (1950, 1951, 1963) has no incorporation of learning - though he later has written on ‘learning by doing’ - so it might be that he assumes standard economic rationality. If that would be perfect foresight, then I(~a) is implied. However, it is better to hold that Arrow in that period discussed constitutional choice for agents and not by agents. The choice for people then is made by some algorithm or calculating machine. His axioms do not describe educated people involved in constitutional choice. Alternatively put, another new result in this chapter is the widening of the scopes of utility and rationality to the inclusion of knowledge about the constitutional process itself. In that sense the original Arrowian axioms can be called incomplete. Alternatively, if the idea is that these axioms concern educated people, then there is a hidden inconsistency, in that reasonable agents are assumed to regard inconsistent axioms as reasonable. [107]

Hence:
ARC = ARe & ARa
It appears very useful to discuss the example given by the Marquis de Condorcet 1785. Sen (1970) gives a simple example that appears to be presented first by Nanson 1882. A similar example is reproduced in Table 12, and I will refer to it as “the Condorcet case”. There are three parties and three topics A, B and C on ballot, and the numbers of seats and the preferences are such that, with pairwise voting and a majority rule, a cycle results: A < B < C < A.
|
Party |
Seats |
|
Topics ordered by preference |
|
Pairwise vote |
|||||||||
|
|
|
|
Low |
Mid |
High |
|
A |
B |
|
B |
C |
|
C |
A |
|
Red |
25 |
|
A |
B |
C |
|
|
25 |
|
|
25 |
|
25 |
|
|
Green |
35 |
|
C |
A |
B |
|
|
35 |
|
35 |
|
|
|
35 |
|
Blue |
40 |
|
B |
C |
A |
|
40 |
|
|
|
40 |
|
|
40 |
|
Total |
100 |
|
|
|
|
|
40 |
60 |
|
35 |
65 |
|
25 |
75 |
|
B |
C |
A |
It is, in all clarity, not that easy to aggregate votes on more than two topics. [108] For two topics one can indeed ask for pro and contra, and find a majority (and occasional ties, for which exist tie-breaking rules). For two topics one can indeed ask for pro and contra, and find a majority (and occasional ties). For more topics, votes will scatter across the topics, and there will often be no clear majority. Therefor, pairwise voting is a good strategy to get the required information on the preferences. However, pairwise voting apparently also causes problems. So, basically, the search is for a strategy without such problems. And that is, basically, also the suggested value of Arrow’s Theorem: that it states that there would be no such good strategy.
However, in this Condorcet example, we may clearly conclude that the cycle primarily means that there is a tie. The situation is in a deadlock, and the group, as a collectivity, is indifferent. That there are indifferences or ties, is nothing special. Standard economic analysis allows agents to be indifferent (we even draw indifference curves), so groups should be allowed to be indifferent too. In Condorcet’s example, indifference is even a logical choice, since when we assume something else, then we quickly run into difficulties.
There is the famous case of Buridan’s Ass (AD 1358). A donkey stands between two equal stacks of hay, at equal distances. He cannot decide which stack to take, and dies of starvation. The upshot of this parable is that rational beings can devise a decision. Constitutions generally state what happens when there are ties. Commonly the Status Quo persists. (This may happen even if it was one of the topics under ballot, and apparently was rejected at that stage.) Alternatives are that the chairman decides, or points are (re-) negotiated, and one can use dice.
It is important to see the difference between voting and deciding. In two stages, the chairperson first lists the votes, and then only secondly gives the decision with a tick of the hammer. Table 12 essentially gives a voting field, and no decision yet. There is no inconsistency as long as we record these results as voting scores, for example “B has more votes than A in a pairwise comparison”. There only arises an inconsistency when we change this into a preference, i.e. decide that “B is better than A”. There are additional rules that translate the field into a unique decision. Part of paradoxical element in voting derives from confusing voting and deciding.
We can use Condorcet’s example to give a short proof of Arrow’s Theorem, restricting our attention to majority voting.
Proof: The group decision in the Condorcet case is indifference, so that B = C. Under the axiom of universality we can look at various preference profiles, of which Condorcet’s example is only one. Now regard the adjusted profile such that the preferences on B and C remain the same, but the preference on A drops to the lowest position. The new profile thus is {A < B < C, A < C < B, A < B < C}. Since the preferences on B and C have not changed, the APDM outcome on B and C should be the same. Majority voting now however results into B < C which differs from B = C. Contradiction. Thus there is a counterexample to the axioms. So the axioms are inconsistent. Q.E.D.
The merit of this short proof is that it clearly shows the awkwardness of the APDM. In the case of Condorcet’s example the conclusion B = C is a sound decision, and in the case of the adjusted example the conclusion B < C is sound too. That preferences outside of the pair B and C have changed is vital to the group decision, since the shift helps a change from clear indifference to clear preference. The preferences on other topics are quite relevant, and not ‘irrelevant’. APDM excludes vital information about the preferences - to be precise: it destroys information that exists - and it should come as no surprise that paradoxes and inconsistencies arise. The APDM is incongruent with the notion of group decision making. Perhaps an individual can exclude information about other topics, but a group cannot. (Or a brain that works as a group cannot.) It is a surprise that APDM has not been killed right in 1951.
Arrow (1951, 1963) introduced an axiom “Independence of Irrelevant Alternatives” (AIIA) that has caused much misunderstanding. That axiom here has been baptised the “Axiom of Pairwise Decision Making” (APDM). Thus the axiom remains the same, only the name is different. The new name is much clearer about what the axiom really means in normal English.
Since the name “IIA” is so entrenched in the literature, this change of name requires some explanation. The explanation is along the lines:
· There is the distinction between voting and deciding.
· Items that cause cycles cannot be called ‘irrelevant’ for decision making.
· The criterion to separate the relevant items from the irrelevant ones is rather the budget and is not necessarily found in pairwise voting for all items.
Arrow's axioms on using the whole commodity domain and universal preferences introduce the possibility that we might also be obligated to consider farfetched items. Arrow introduced the APDM to limit this effect again, since it allows that a decision on our current issues can be taken independently from other farfetched possibilities. It is reasonable that people neglect farfetched possibilities. Thus Arrow on one hand opens the door wide for such farfetched possibilities, and on the other hand introduces a strict condition that kills the relevance of this. The whole looks reasonable, since people in fact neglect farfetched possibilities.
Yet, the whole does not conform with the practical situations in Parliaments, where the problem is defined for existing voters and where the issues on table are given by the budget set.
Thus, (a) the notion of ‘irrelevance’ is dealt with by considering the budget set, (b) the axiom can be named after what it properly does: pairwise decision making.
If we want to deal with possibly farfetched preferences of some citizens, which is the moral meaning of the axiom of universal preferences, then we should work towards practical procedures that work. Assuming inconsistent axioms is not a good way to deal with that moral question.

The following sections use formal logic.
Lemma A.I: AF implies that a constitution p satisfies the property Op
p.
First proof: AF
means that desires (Op) in conflict with reality (~p)
are not entitled to consideration. But
p ~(Op & (~p)) is equivalent to
p Op
p. Q.E.D.
Second proof: We already concluded that the most
preferred point (Op) would also be the chosen point (p). Thus
p Op
p. (If the point is
not preferred, then the implication is true ex vacuoso.) Q.E.D.
Discussion: We have enlarged the commodity domain with constitutions, and hence the axiom of feasibility becomes a bit stronger. The extension itself is rather weak, since we only extend on consistency (and not empirical validity). Our criterion is as that a reasonable society would stick to its rules. The gap between Is and Ought still exists in principle, but can in practice be bridged by the human effort to attain one’s ends.
Theorem A.1: For a reasonable society, the AMC is invalid.
First proof by rationality & moral consistency (DA):
Assume Oa. But a
~a, and with DA we get O~a. But this gives
a preference inconsistency Oa & O~a. Hence ~Oa. Q.E.D.
Second proof by rationality & moral consistency (DA):
Assume Oa. Since a
falsum we find Ofalsum. Thus for some p0
we have O(p0 & ~p0). But
this means Op0 & O~p0, and that
is a preference inconsistency. Hence ~Oa. Q.E.D.
First proof by realism (AF): Assume Oa. By
the lemma
p
Op
p we
find a. But then we have ~a & a, which is an inconsistency.
Hence ~Oa. Q.E.D.
Second proof by realism (AF): Since ~a
and above lemma ~a
~Oa, hence ~Oa. Thus the axioms are not morally
desirable either. Q.E.D. Note: q
p is equivalent to ~p
~q, and we may take q
= Op.
When the axioms would be morally desirable, then the derived contradiction would be morally desirable - but nobody can be asked to do the impossible. Hence the axioms are not morally desirable. This is a seemingly simple reasoning scheme, but destructive to the accepted view.
Theorem A.2: For a reasonable society, the ARC is invalid.
Proof: Given AF, infeasible choices are not considered. Since ~a, apparently a is not feasible, and the Arrow constitution is not reasonable. So it is invalid that the axioms would be reasonable. Q.E.D.
Discussion: As we stated above, we have enlarged the commodity domain with constitutions, and hence the axiom of feasibility becomes a bit stronger. The extension itself is rather weak, since we only extend on consistency (and not empirical validity). But the conclusion is strong. No reasonable society in its right mind would want to accept Arrow’s axioms as its constitution. Supposedly at a chaotic Boston Tea Party a constitution c = a might be tried, but pretty soon rational people would see that they should make another constitution, for otherwise the situation will remain chaotic, and the Tea Party will not go down into history as a notable event.
Note that Arrow adopts feasibility, but also wants to impose infeasible conditions.
When Arrow’s axioms would be reasonable, then they would have to be consistent as well. However, they are inconsistent. Thus they are not reasonable. This seems a rather simple scheme of reasoning, but it destroys the impact of the Theorem.
For the axioms, there is the subtle difference between ‘reasonable’ and ‘seemingly reasonable when considered by itself’. The following is a good analogy. For a bicycle we want round wheels for when it rides. For a bicycle we also want square wheels, so that it does not fall when it stands still. But there are no round squares ! Ergo, conditions that seem reasonable by themselves, create something impossible and decidedly unreasonable when combined. To conclude ‘there is no good bike’ would however be absurd. Admittedly, it is a good teaching method to first convince students that something would be reasonable, and then have them derive a contradiction. As with the buying of a bad second-hand car, the students learn to be careful, and they learn a respect for science and the value of modesty. This teaching method however overshoots when people remain believers of the reasonableness of the assumptions - as apparently happened with the assumptions of Arrow’s Theorem. A paradox is only a seeming contradiction. Thus there must exist a system that we are willing to accept as the optimal one.
Many mathematicians have been sensitive to the distinction between ‘reasonable’ and ‘seemingly reasonable when considered by itself’, but the literature also abounds with instances where this distinction is not applied with sufficient care. Part of the accepted view thus is a case of bad communication of the incrowd with the larger public. (Given above quotes, the incrowd however might be small. Quis custodet custodes ?)
The selection of the culprit axiom is straightforward. We order the axioms by preference, for example AD > AWP > AU > APDM. From ~a, we conclude that we have to drop one of the axioms. We drop the least preferred one. My discussion on Condorcet’s example should generate support for the rejection of APDM. Basically though, scientists can only advise on preferences, and the proper decision is up to the body politic.
Lemma A.II: If all agents have a > APDM then, with AWP, society has [AU, AWP, AD] > APDM. Note: here [x, y, z] means the unordered set.
Proof: obvious.
Discussion: When all people put AU, AWP and AD in any individual order, but all would have APDM below these, then society can reject APDM unanimously. In fact, the condition AU might as well be regarded as part of the definition of a SWF-GM, and similarly, AWP could as well be regarded as part of the definition of the notion of collective preference. So the real choice concerns AD and APDM, or between dictatorship or not.Here a selfish dictator and his associates would have ¬AD > APDM > AD. The Jorgenson quote suggests his preference for a benevolent and non-selfish dictatorship, but, also since such dictatorships tend to turn sour, my impression is that he would eventually be an associate of a real dictator. Most likely, he did not understand the situation when the quote was printed.
Note that ordering the axioms means that the deontic predicate O is not homogeneous. This means that deontic logic may be more related to preference theory than deontic theorists think.
Consistent constitutions violate one of the axioms of Arrow’s Theorem. Violating one of these axioms is to be considered useful for reasonableness and morality, rather than the reverse. (That is what we proved above.)
One general feature is a Status Quo that persists when there are ties.
One example already has been mentioned in the discussion of the Condorcet problem. With majority voting, a cycle means indifference, and there are various ways to solve ties. One possible solution is the persistence of the Status Quo.
Another example constitution is the “Pareto-Majority” rule. One first selects all Paretian improvements from the Status Quo. That is, those points where some advance while nobody loses. There may be more Paretian points, such as B > A and C > A, with the Status Quo as A. When there is no Paretian order between B and C, then it suffices to decide on these points by simple majority. Of course, with more than two points, majority voting can result into cycling, but that again means indifference, which could be settled by dice, by the chairperson, or by other creative ways.
See my home page and The Economics Pack for implementation of these rules in the program Mathematica. Little helps so much as a trying it out for yourself.
Our discussion arrives at a conclusion that differs from the literature, and thus warrants a reappraisal of that literature. This reappraisal is not the topic of this paper, but some examples are useful.
(1) Note that the Tobin quote above was misleading. The problem with ‘unborn generations’ should not be mixed up with the Arrow difficulty. The Tobin problem actually can have a rather simple solution. It are the preferences of the currently living that matter, and what they prefer for the future unborn (which can also be based on a forecast of such preferences). These future preferences cannot logically be included, since they don’t exist yet.
(2) Arrow 1951 also stated:
“If consumers’ values can be represented by a wide range of individual orderings, the doctrine of voters’ sovereignty is incompatible with that of collective rationality.”
This is clearly inaccurate. The statement suggests that we have to adopt Arrow’s axioms, while the sensible thing is to reject these axioms and to adopt both voters’ sovereignty and collective rationality.
(3) One of the more interesting points made here is the distinction between the learning process and the end result. How should Arrow’s result be presented in the future ? Is it possible to maintain the teaching strategy to call the axioms ‘reasonable’, then have the students get into a fixture, and them let them find a way out ? It is good teaching practice ! However, in a Palgrave meant for a wider audience (or a general encyclopedia that even might be read by dictators), it might be improper to call Arrow’s axioms ‘reasonable’. It should be ‘seemingly reasonable’ at the least.
Note that the phrase then becomes less enchanting:
‘there is no social choice mechanism which satisfies a number of seemingly reasonable conditions’.
(4) I am a bit shocked by Mueller’s (1989, p406-407) discussion of Arrow’s general view. One would expect a more critical attitude, but finds instead:
“The Arrow and Sen theorems (...) raise fundamental questions about the possibility of establishing collective choice procedures satisfying minimally appealing normative properties (...) But the negative side should not be overemphasized. We have suggested that both sorts of paradoxes might be avoided with the use of cardinal, interpersonally comparable utility information. Arrow explicitly eschewed the use of such information, and the independence of irrelevant alternatives [thus Pairwise Decision Making / TC] axiom was imposed to rule out voting procedures that might make use of such information (... But it) is possible that the citizens may be trusted to make these comparisons in an ethically acceptable way.”
Well, interpersonal comparison of course occurs, minimally, when we assign votes to people, assign rights to put topics on ballot, and the like. So interpersonal comparison is not as bad as many economists seem to think. But my solution to Arrow’s difficulty does not rely on cardinality and cardinal comparison. So, disappointingly, Mueller both accepts the idea that Arrow would cause ‘questions’ about the possibility of social choice, and he comes with a wildly wrong conclusion. This is supposed to be a modern textbook !
(5) What is important, is that the development of economic theory and the development of real economies have been hindered by the confusion generated by the standard explanation. Where decision makers were divided, some interested in social welfare and others not, the latter group was provided with decisive gunpowder - and beware of people who have an ideology and even wield a mathematical theorem to prove their lunacy. Generations of students have been taught by Nobel Prize laureats that research into social welfare would be subject to impossibilities. Creative energy has been directed to enlarging the impossibilities rather than to devising structures that might improve practical situations. Practical research into social choice functions and parameters has been aborted, all with reference to a misunderstood theorem !
Economic research also leads to a suggestion of a constitutional amendment, see Colignatus (1996b) and the appendix. I hope that this present chapter helps to clarify that this kind of research is a useful type of economics.
(6) This analysis also clarifies a confusion about the relation of constitutions to the SWF. While many economists argued that constitutions could not be reasonable or morally acceptable, they did accept the Bergson-Samuelson SWF, even though the latter was derived from the former - and nobody seems to care about this inconsistency. Which is now removed, since the properties of the constitution are projected into the SWF.
(7) It is relevant to note that I gave this analysis earlier, in Colignatus (1990c, 1992a). This chapter is almost 99% the same as 1997b, and a a rephrasing of the main principles. I have had no success so far in getting a publication, neither at the CPB nor in a journal. [109]
Arrow’s Theorem has given some problems in the literature, see the quotes above. We have achieved the following solution:
·
There is more clarity now, by the distinction between the theorem
proper (a
falsum), the moral claim (Oa) and the claim on reasonableness (AF
and I(~a)).
· The arguments above on rationality and morality have a destructive character since they reject the accepted view. In another perspective they are constructive, since they allow the formalisation of (meta) notions, and bring these back into mathematics again (notably the voting on constitutions).
· From a mathematical point of view, the Arrow axioms are incomplete for decision making in a reasonable society.
· It has been shown that the APDM is undesirable. Dropping APDM is not a sad state of affairs, as is sometimes suggested in the literature, but a sign of understanding group decision making.
· The Arrow axiomatisation does not capture the truly desirable properties required for a constitution, both by incompleteness and APDM.
· There are detail results, such as the distinction between voting and deciding, the integration of preference theory and deontic logic, and a proof of Arrow’s Theorem that shows clearly the abuse by APDM.
· We have given examples of consistent constitutions that many might regard as optimal.
Sen (1999a:250-253) contains a short summary discussion on his view on the Theorem. First I quote him and then give my comment. Sen states:
“The Arrow Theorem does not in fact show what the popular interpretation frequently takes it to show. It establishes, in effect, not the impossibility of rational choice, but the impossibility that arises when we try to base social choice on a limited class of information.”
This is not correct. Using the information provided by pairwise voting results, we can decide to a tie (deadlock, indifference) when such might arise. It is the adoption of the APDM axiom that, wickedly, turns this indifference into an inconsistency. The APDM does not mean lack of information, it only corrupts the information that exists.
“At the risk of oversimplification, let me briefly consider one way of seeing the Arrow theorem. Take the old example of the “voting paradox,” with which eighteenth-century French mathematicians such as Condorcet and Jean-Charles de Borda were much concerned. If person 1 prefers option x to option y and y to z, while person 2 prefers y to z and z to x, and person 3 prefers z to x and x to y, then we do know that the majority rule would lead to inconsistencies. In particular, x has a majority over y, which has a majority over z, which in turn enjoys a majority over x. Arrow’s theorem shows, among other insights it offers, that not just the majority rule, but all mechanisms of decision making that rely on the same informational base (to wit, only individual orderings of the relevant alternatives) would lead to some inconsistency or infelicity, unless we simply go for the dictatorial solution of making one person’s preference ranking rule the roost.”
Locating the problem in the informational base is erroneous. Clearly, majority decision does not lead to inconsistencies, for it is the use of the APDM axiom that does so - and we don’t need it for majority decisions. The Arrow Theorem does not show that there are inconsistencies for all mechanisms - we namely can use mechanisms without APDM.
“This is an extraordinarily impressive and elegant theorem — one of the most beautiful analytical results in the field of social science. But it does not at all rule out decision mechanisms that use more — or different — informational bases than voting rules do. In taking a social decision on economic matters, it would be natural for us to consider other types of information.”
I don’t know about “extraordinarily impressive and elegant”. Condorcet came up with his paradox, as earlier people came up with paradoxes when dividing by zero, as Bertrand Russell had his set-paradox, and as the Cretian Epimenides said “All Cretians are liars.” Arrow’s Theorem solves the Condorcet paradox by showing that we must not use APDM - though Arrow apparently did not realise that. The theorem is basic, and we must be glad that we have it, as APDM apparently can cause a lot of confusion, as the last 50 years have shown.
“Indeed, a majority rule — whether or not consistent — would be a nonstarter as a mechanism for resolving economic disputes. Consider the case of dividing a cake among three persons, called (not very imaginatively) 1, 2, and 3, with the assumption that each person votes to maximize only her own share of the cake. (This assumption simplifies the example, but nothing fundamental depends on it, and it can be replaced by other types of preferences.) Take any division of the cake among the three. We can always bring about a “majority improvement” by taking a part of any one person’s share (let us say, person 1’s share), and then dividing it between the other two (viz., 2 and 3). This way of “improving” the social outcome would work — given that the social judgment is by majority rule — even if the person thus victimized (viz., 1) happens to be the poorest of the three. Indeed, we can continue taking away more and more of the share of the poorest person and dividing the loot between the richer two—all the time making a majority improvement. This process of “improvement” can go on until the poorest has no cake left to be taken away. What a wonderful chain, in the majoritarian perspective, of social betterment!”
Remember that Sen writes this book for a general audience of economists who will not have gone deeper in social choice theory. Though Sen now relates basic truisms, his reasoning nevertheless is a bit off. Indeed, Western democracies tend to have property rights and a “status quo” rule, and a Madisonian philosophy that democracy actually exists to protect the minorities. We use all kinds of additional information, in order to settle problems of fairness and equity. Thus the majority rule is not suggested for the raw form that Sen uses as an example. Then, crucially, when Sen suggests that this example clarifies that we must use more information to solve the Arrow paradox, then this is a non-sequitur. His argument becomes seductive, since the reader is seduced into thinking that, indeed, we use more information. But the truth is that we use this additional information to solve equity matters, and not to solve the Arrow inconsistency.
“Rules of this kind build on an informational base consisting only of the preference rankings of the persons, without any notice being taken of who is poorer than whom, or who gains (and who loses) how much from shifts in income, or any other information (such as how the respective persons happened to earn the particular shares they have). The informational base for this class of rules, of which the majority decision procedure is a prominent example, is thus extremely limited, and it is clearly quite inadequate for making informed judgments about welfare economic problems. This is not primarily because it leads to inconsistency (as generalized in the Arrow theorem), but because we cannot really make social judgments with so little information.
“Acceptable social rules would tend to take notice of a variety of other relevant facts in judging the division of the cake: who is poorer than whom, who gains how much in terms of welfare or of the basic ingredients of living, how is the cake being “earned” or “looted” and so on. The insistence that no other information is needed (and that other information, if available, could not influence the decisions to be taken) makes these rules not very interesting for economic decision making. Given this recognition, the fact that there is also a problem of inconsistency—in dividing a cake through votes — may well be seen not so much as a problem, but as a welcome relief from the unswerving consistency of brutal and informationally obtuse procedures.”
Sen is aware that his reasoning is not strict (vide his use of “primarily” and “also”) but, still, he makes the suggestion, which is erroneous.
Indeed, the spirit of “impossibility” is not, I believe, the right way of seeing Arrow’s “impossibility theorem.” [footnote] Arrow provides a general approach to thinking about social decisions based on individual conditions, and his theorem—and a class of other results established after his pioneering work — show that what is possible and what is not may turn crucially on what information is taken into effective account in making social decisions. Indeed, through informational broadening, it is possible to have coherent and consistent criteria for social and economic assessment. The “social choice” literature (as this field of analytical exploration is called), which has resulted from Arrow’s pioneering move, is as much a world of possibility as of conditional impossibilities. [footnote]”
This quote just repeats the error - and adds a string of perceptions to sweeten the cake. The footnotes are references to his “Collective choice and social welfare”, his Handbook contribution and the Nobel lecture, Sen (1999b), and add no news, for us, to the essence discussed here. Indeed, the obviously relevant Nobel lecture just repeats the error.
Hence, Sen basically does not understand the problem. I do value his work on social choice since it was a useful guide to me in making Arrow’s result accessible, and in seeing the various perspectives of it. As Newton is reported to have said: “Standing on the shoulders of giants, we can look further.” I cannot wait till Sen writes me that he enjoys my solution !
Andreu Mas-colell, Michael Whinston and Jerry Green ’s 1995 “Microeconomic Theory” is just wonderful. A great book. Generally speaking, though, since they erroneously write: “Either we must give up the hope that social preferences could be rational in the sense introduced in Chapter 1 (i.e. that society behaves as an individual would) or we must accept dictatorship.” (p780). And the subsequent discussion indeed leads the student in the bogs and misdirections so typical of 20th century ‘social choice theory’. The math is OK, but concerns something like the question of how many angels can dance on a pin’s head - and the whole induces the student to become wary of social decision making. (To be sure: I appreciate the other qualities, and have used the book for sections of my Economics Pack.)
Theory shows that voting is subject to paradoxes, while it also appears that a voting result is caused as much by the procedure as by the voters’ preferences. From a moral point of view, the choice of the procedure then is the major issue. A key insight is that morality presumes time. In a static world everything is given and there is no place for individuals who have to ponder their moral choices. The real world is dynamic however and the most challenging voting paradoxes concern budget changes. The paper develops a new “Borda Fixed Point” mechanism that provides a better protection to surprises by such budget changes. Under dynamics, Donald Saari’s argument on symmetry is less convincing.
The currently accepted view is sometimes expressed as that ‘there is no ideal voting scheme’. The former chapter destroyed that view. There is no mathematical reason to think that such an ideal cannot exist. Since Arrow’s axioms must be rejected, they do not form an ideal. An ideal still can exist, but apparently it is different than originally thought. Perhaps people have different ideals, but then the non-existence of a common ideal derives from empirically different opinions and not from mathematical reasons. Since people can benefit from co-operation, they can still aspire at a scheme that all can agree upon.
Above analysis does not answer the positive question yet what would be a generally good system. The main point here is that everyone should determine this for oneself. Theory can only help to remain consistent. The following is a suggestion for a scheme that is consistent and that could appeal to many.
One important idea is that time plays a role. The basis for this idea is that, abstractly, morality presupposes time. Without time there would be no morality. In a static world everything is given, and there is no place for an individual who has to ponder his or her moral choices. As economists, we can draw static utility functions and isoquants, but those are abstractions, and they might distract from the real moral problem. The moral problem is that now a decision has to be made while the consequences appear later. Afterwards, everything can be explained deterministically (which is the meaning of ‘explanation’), and by hypothesis, determinism will also hold for the future. Yet, in the mean time forecasts are imperfect, there is fundamental uncertainty, and that creates the possibility of morality (or the illusion of morality).
Economic science is intended to help explain reality. In this reality, we see an evolution of human beings in a social process of natural forces. The basic concept is power, in a continuous process, so that the basic approach uses ratio scales and cardinal utility and not ordinal scales. Other assumptions than cardinality enter the discussion only when the group wants to control power, and for example introduce democracy. A common notion is that economists reject cardinality and interpersonal comparison of utility. However, the concept of ‘one person, one vote’ actually imposes some interpersonal comparison of utilities. Also comparing orderings of preferences implies some comparison of utilities. The proper perspective is rather that cardinality is deficient since people can cheat about their preferences (at least in the current state of technology). The major argument for ordinality is that it limits the room for cheating. If people could not cheat, interpersonal comparison likely would be much more popular amongst economists. The point that ordinality reduces interpersonal comparison thus seems less relevant than the point that cardinal comparisons are unreliable since people can cheat.
For example, when a family goes on holiday and has the choice between Spain or Greece, then little Robby might exaggerate his preference for Greece and say that he might as well die when Spain is selected. When the aggregation of preferences would be cardinal, such a huge negative weight for one option would certainly block it. Imposing ordinality limits the impact of cheating however. In common textbooks on voting theory, cheating comes in relatively late, but it is more adequate to start right away with that notion. The crucial insight is: Arrow’s Theorem and the voting paradoxes are the price that we have to pay in order to limit that impact of ‘stategic’ voting behaviour.
Arrow’s orginal question whether there could not exist a generally good voting mechanism remains a valid question, though. As history has shown, mathematicians are proficient in identifying paradoxes and in deriving new impossibilities, and one will not quickly find a suggestion for a generally good system. But it appears that when we consider the issue of time, then a solution tends to suggest itself. To understand this solution, it is useful to first consider three main contenders, i.e. the ‘traditional’ solutions provided by Plurality, Borda and Condorcet. There are other methods, but their properties are such that they need no consideration here.
In Plurality, all voters have one vote, and the candidate with the highest number is selected. Note the problems with this method. The criterion of ‘highest number’ does not imply that the winner must also have more than 50% of the vote. If this is additionally imposed, then this may require more rounds of voting, and then there is the difficult issue whether candidates have to drop out, and if so, how.
Borda’s method is to let each voter rank the candidates by importance, then assign weights given by the rank position, to add the weights per candidate for all voters, and then select the candidate with the highest value. Note that the method appears sensitive to preference reversal, see below.
Condorcet’s method is to vote on all pairs of candidates, and to select the one who wins from all alternatives. Note that such a “Condorcet winner” does not need to exist. In that case the margins of winning can be used to solve the deadlock - but this increases the sensitivity to who participates.
The following example is taken from Saari (2001ab). Consider a budget of three candidates A, B and C, and let there be 114 voters. When we neglect indifference and use strict preference only, then with 3 candidates there are 3! = 6 possible ways of ranking them. Table 13 contains an arbitrary allocation of those voters over such preferences. The highest ranking candidate gets rankorder weight 3, the second gets weight 2, and the least preferred candidate gets weight 1. In the table we can read for example that there are 33 candidates with preference A > B > C.
|
Number of voters |
Candidates and their rank order weight |
||
|
Sum 114 |
A |
B |
C |
|
33 |
3 |
2 |
1 |
|
0 |
3 |
1 |
2 |
|
25 |
2 |
1 |
3 |
|
17 |
1 |
2 |
3 |
|
14 |
1 |
3 |
2 |
|
25 |
2 |
3 |
1 |
|
Results of the procedures |
|||
|
Mostly preferred |
33+0 = 33 |
14+25 = 39 |
25+17 = 42 |
|
Borda |
230 |
242 |
212 |
|
Pairs: A vs B |
58 |
56 |
- |
|
A vs C |
58 |
- |
56 |
|
B vs C |
- |
72 |
42 |
The different voting schemes result into different decisions:
1) Plurality: Voters give one single vote to the candidate of their highest preference. For candidate A we consider its column, select the rows with the score 3, and add the associated numbers of voters 33 + 0 = 33. And so on. Candidate C gets most votes, namely 42.
2) Borda: The votes are weighted with the rank order weight. De column for A is multiplied row by row with the number of voters 3 * 33 + 3 * 0 + 2 * 25 + … = 230. Candidate B gets most votes, namely 242. (Scores -1, 0, 1 might calculate easier.)
3) Condorcet: Voting pairwise over A versus B, there are 33 + 0 + 25 = 58 voters who give A a higher rankorder than B. Etcetera. Candidate A appears to win from both B and C, and then is the “Condorcet winner”.
This example shows that A, B and C can all be winners, depending upon the method selected. The properties of the methods then are the true issue.
Above still neglects strategic voting. This could be represented by a change in apparent position. How do we evaluate this ? It appears that the Condorcet approach is least sensitive to cheating since in a pairwise vote there is an incentive to express one’s true preferences. Pairwise voting however can be unattractive since there need not be a Condorcet winner, or, when one exists, it may conflict with the preference rankings. One way to solve the complexity of choosing between these methods is to compromise by having a run-off election. The two top outcomes of Plurality or Borda are taken and then subjected to a pairwise vote as in Condorcet. There is one final consideration. Simply taking the two ‘top outcomes’ seems unduly simple, we should consider what these actually are. In France, the election between Chirac, Jospin, Le Pen and others caused Jospin’s votes to scatter over all kinds of smaller parties so that he dropped from the race while he was the Condorcet winner of both Chirac and Le Pen. When we are compromising, we should focus on determining the two main contenders.
Let us reconsider the dynamic process that occurs within an economy. We see that under the influence of time, the budget changes continuously. A voting scheme naturally requires that there is a list of candidates, but one cause for paradoxes is that that list is not fixed. For example, in the Borda vote above, B is selected, but if C decides to withdraw (or gets a heart attack), then we would expect B to remain the winner, but suddenly it is A (see the Condorcet vote A versus B). Remember also the Bush, Gore and Nader case. We could consider a procedure to be better when the choice is less dependent upon changes in the budget.
A way to achieve this is to use the notion of a ‘fixed
point’. For a function f: D
R, for some domain D and range R, the point p
is a fixed point iff f(p) = p. Let us consider this concept for
voting.
Let P be the voting procedure, and let X = {x1, …, xn} be the budget with all the candidates. Let the unrefined winner be w = P(X). Let Y be the budget when w does not participate, Y = X \ {w}. Let the ‘alternative winner’ be v = P(Y) = v(w), i.e. the candidate who wins when the first winner w does not participate. This is not simply the run-off between the winner and the common runner-up, since the selection of the alternative winner requires the recalculation of the preference weights. This alternative winner can be seen as a ‘summary’ of the opposition to w. The scheme is a compromise since the Condorcet pairwise condition holds for the winner and the alternative winner. While these notions are defined with respect to the unrefined winner, we can generalise this to any winner, and in particular to our optimal winner.
An alternative condition for winning in general is the ability to win from one’s strongest opponent. This gives the fixed point condition. Define f(x) = P(x, P(X \ {x})), which is the general function ‘the vote result of x and its alternative winner’. Then w* is the solution to the fixed point condition x = f(x):
w* = P(w*, v(w*)) = P(w*, P(X \ {w*})) = f(w*)
When the unrefined winner w is
not a fixed point, i.e. when the unrefined winner w = P(X)
appears to lose from v, so that w
P(w, v), then the search process
can start again from v.
It appears that this fixed point voting procedure reduces the dependence upon budget changes. There can still be a dependence, but it is not as large as without the condition.
In Table 13, the Borda Fixed Point winner is A. With B the Borda winner, A is the alternative winner when B does not participate, and B loses from A in a pairwise match; starting the search from A, its alternative winner is B, and A wins from B.
More on this can be found in Colignatus (2001). That book has also been intended as a textbook and it developed Mathematica programs for the various voting schemes and data manipulations. Given the complexity of the matter, this working environment has appeared a great advantage.
Donald Saari (2001ab) showed that Borda’s method is the only method that satisfies certain symmetries. His suggestion is that the Borda rule ‘therefor is best’. This argument does not convince by itself since ‘symmetry’ is not by itself a moral category. Dynamics is linked to morality, by the notion that morality presumes time, and thus seems a better angle.
Consider direct symmetry first. Suppose that your preference is A > B > C and that my preference is C > B > A. The direct symmetry consideration is that we might both abstain from a vote and stay home, since our preferences strictly oppose each other. Saari noted too that voting cycles can be catalogued under the mathematical concept of rotational symmetry. His subsequent suggestion is that cancellation should hold for all symmetries for all subsets of voters.
What happens when cancellation of ‘rotational symmetry’ is applied to subsets ? The following is an example by Saari that cancellation isn’t trivial then. In Table 14 there are 48 voters, and B is selected by both Borda and Condorcet. In Table 15, 27 voters have been added who have the mentioned rotational symmetry, with 9 for each subgroup. Now Borda still selects B, but Condorcet, and the Borda Fixed Point, select A. In Saari’s view, Borda satisfies symmetry, and ‘hence’ is the better method.
My reasoning is a bit different. First of all, note that I myself have used an argument similar to that of Saari. In my view, the typical Condorcet situation of three preferences A > B > C, B > C > A and C > A > B results into indifference rather than an inconsistency, and I use this against Arrow’s analysis. So I agree with Saari’s view that such votes cancel. I applaud Saari’s insight that if you apply cancellation for all cycles in all subsets, then the logic is to get rid of Condorcet’s method and to use Borda’s method.
Table 14: Start with 48 voters: Borda B, Condorcet B
|
|
Candidates and their rank order weight |
||
|
Number of voters |
A |
B |
C |
|
20 |
3 |
2 |
1 |
|
28 |
2 |
3 |
1 |
|
Borda weighted total |
116 |
124 |
48 |
|
A versus B |
20 |
28 |
|
|
A versus C |
48 |
|
0 |
|
B versus C |
|
48 |
0 |
Table 15: Add 27 ‘neutral’ others: Borda B, Condorcet A
|
|
Candidates and their rank order weight |
||
|
Number of voters |
A |
B |
C |
|
20 |
3 |
2 |
1 |
|
28 |
2 |
3 |
1 |
|
9 |
3 |
2 |
1 |
|
9 |
1 |
3 |
2 |
|
9 |
2 |
1 |
3 |
|
Borda weighted total |
170 |
178 |
102 |
|
A versus B |
38 |
37 |
|
|
A versus C |
57 |
|
18 |
|
B versus C |
|
66 |
9 |
Secondly, however, my problem remains that there is the phenomenon of budget changes. Note that Saari’s example uses a changing electorate rather than a changing budget. My suggestion is that a change in the electorate would require a new vote, while we would want to avoid that in case of a change in the budget. The Borda method would be best, only when the budget would be really given. When it might change, the application of cancellation to all subsets becomes doubtful, since subsets change. There is a fundamental uncertainty with respect to the future. Consider the following example. At a specific point in time, the population of a nation is given, and thus the vote for a President has a specified budget: the population. But, uncertainty sets in again, when people may withdraw from the race. Only a few actually run. Hence, we might well want a rule to deal with possible changes in the budget. Hence, it is not logically required that we cancel votes for all possible subcycles (also for candidates who are not in the race). Saari is very strong on the argument that when we accept cancellation in one case, then we should do so in all cases. I am more sensitive to the exception: when ‘if one, then all’ does not hold.
Concerning Table 14 and Table 15, my reasoning is - contrary to Saari - that the added votes cannot be neglected. The argument of rotational symmetry breaks down when we compare a winner with the alternative winner - which is a pair - while rotational symmetry requires a third candidate or more. For the pair, the addition has an effect. When we consider unrefined winner B and its alternative winner A, then the added votes are in favour of A and no longer ‘neutral’. While C is important since it shows a cycle for a subgroup of voters, another view is that C could be neglected since it is not a fixed point. Canditate C is a typical example of an irrelevant candidate that can cause a preference reversal in Borda voting. Namely, let us consider Table 15 under Borda voting, and let C decide to drop from the race: then A becomes the winner. The Borda Fixed Point method has been developed precisely to deal with that kind of preference reversal.
Thus, when you select your voting method then you must choose between the properties exemplified by this case. (1) Borda is subject to preference reversal. In the example of Table 15, when C drops out, then there would be switch from B to A. (2) The Borda Fixed Point method still depends upon the voting field. In this example, when 27 voters drop out, then there is a switch from A to B.
The choice basically is whether we attach more importance either to the voters or to the candidates. Saari suggests that the candidates are more important, since he cancels the votes of 27 voters and keeps C in the race. I would say that the voters are important and that candidate C is less relevant. The proper question would be whether the winner is a convincing winner. Of course, C can become an important candidate when we add other voters. But then the argument is that those voters count, rather than C.
Consider the impact of semantics. While it has been a long standing notion that cycles may also be taken as indifference, so that the votes cancel, Saari now rephrases this as rotational symmetry, and he suggests that acceptance of rotational symmetry implies acceptance of it for all cases and subsets. The label might be a common mathematical label, but I have a problem with that label in the realm of morality (and the implied universality). Human beings seem to have biological preference for symmetry, and by labelling something as ‘symmetry’, it becomes more attractive. When discussing the different voting schemes, we should be aware of such effects, and try to focus on what the properties really mean, and we should make a proper distinction between a property that is universal and a property that is dependent upon the situation. Perhaps it might be analysed as the ‘mathematical frame of mind’ that acceptance of a property for one set also implies acceptance for all other (sub-) sets, but my conclusion is that when we look closer, that there is room for more subtlety. Indeed, it might well be that considerations of symmetry apply to the static situation, but that we need other considerations for dynamics.
Another example for this need for subtlety is that the ‘rotational symmetry’ argument breaks down on the status quo (see below).
Saari has also developed an ingenious way to depict voting schemes geometrically. For 3 candidates, this becomes a triangle, and the different procedures can be calculated from that. It appears that these triangles are a good educational tool. However, my experience is that the computer programs (Colignatus (2001) uses Mathematica) are easier to use, since they take away the need for calculations, while they are available for more dimensions and also allow for indifference and not just strict preference. A complex scheme like the Borda Fixed Point also requires more work with the triangle, while in Mathematica it is a simple procedure call. It may be noted that above discussion of the Borda Fixed Point method has been simplified by assuming single winners. In practice, there can be ties, complicating the search, and requiring tie-breaking rules.
Another consequence of the switch of attention from statics to dynamics is the recognition of a status quo.
There appears to exist another wide-spread confusion about ‘majority voting’. This idea is that a majority result would still be democratically valid, even if the winning decision implies a real loss for the opposition. The counter-example is when the majority decides that the minority pays $1 to the majority: this is not necessarily a morally acceptable situation, even though there is a majority. From a moral point of view, each voting scheme should have two rounds: a first round to select the Pareto improving points compared to the status quo, and then a second round to select the winner from those Paretian improvements. The majority rule thus can be regarded as only a tie-breaking rule, namely for the deadlock when there are more Pareto improving points. In elections of persons, the status quo can be a vacancy, and in that respect all candidates could be taken as Paretian. But the Paretian pre-condition cannot be skipped in general.
The Paretian condition may require some subtlety. Consider the family choice for a holiday to Greece or Spain, discussed above. If little Robby considers the holiday to Spain to be a deterioration from the status quo of not having a holiday at all, then there is moral argument to say that Spain is not a valid option to take a vote on. However, if it can be established in a first round that going on a holiday is unanimously a good idea, then Robby has to accept a possible majority decision in favour of Spain and against Greece.
One argument against the selection of Pareto improving points is that people might also cheat about these points. This argument is not convincing, since Pareto improvement is in one’s own interest. Indeed, little Robby might try to veto Spain by saying that he does not want a holiday, and thus he might be trying to bargain to get everybody to accept Greece. However, this ploy can be prevented by having that first round on having a holiday, since if he really wants a holiday anyhow, then he has to show this then. Careful construction of the voting process thus remains an issue.
One of the key problems in voting theory is strategic voting behaviour, better known as cheating. In a scheme like Borda, cardinal utility has already been reduced to ordinal utility, so perhaps we should be lenient and allow voters to maximize their utility from the final outcome by manipulating their vote. But our opinion on this does not matter, since the ballot generally is secret and we cannot stop people from voting strategically anyway. In fact, my Mathematica programs, Colignatus (2001), contain routines for cheating. These are simple routines that assume both full information and that others don’t cheat, since the mathematics of cheating while assuming that others cheat too is rather complex, especially when nobody has full information about the true preferences. Given all this, one surmises that election results do not reflect the true state.
Thinking about these issues gave me an idea that might be helpful to elicit the true state. Suppose that each voter is informed in advance that there is a probability p that the ranking order that is submitted will be used by the election computer for strategic voting. If the voter submits his or her true ranking, then this is rewarded with probability p to improve the election result for that voter, and much better than the voter can, since the computer knows all submitted rankings. If the voter submits a strategically adapted ranking, then this is punished with probability p namely to improve the election result for that false ranking. Likely there is a specific value of p that would generate the most truthful election result. Unfortunately, I haven’t had time to develop this idea.
An election result is ‘as much’ the result of the procedure as of the preferences. Arrow’s Impossibility Theorem is complex and full with paradoxes, but the dependence of morality upon time provides a way towards solution.
There are two key conclusions:
(1) The Pareto condition for the candidates under ballot should not be neglected - i.e. that only those candidates are voted on that are an improvement compared to the status quo.
(2) The Borda Fixed Point can be seen as a compromise between the Borda and Condorcet procedures (on Paretian points), and provides a degree of protection against budget changes.
There is also another conclusion. Voting is complex, and becomes increasingly complex when the numbers of candidates and voters rise (especially when we also include indifference and not just strict preference). Direct election of a President becomes quickly infeasible for the more advanced voting procedures. From this observation we can conclude that it is better to have a proportional parlementary system, so that the elected professionals can use the advanced voting procedures to select the President. This approach of representation also prevents that there is a different electoral mandate for President versus Parliament. Note that the discussion above, on Arrow’s Theorem and the Borda Fixed Point method, considers single seat elections, and not multi-seat elections. But the complexity of direct single seat elections tends to support this conclusion on the overall system of proportional representation and indirect election of the chief executives.
The following notes on ethics are not well developed but the points are useful to observe.
(a) I was struck by Keynes’s quote: “along the line of origin at least, economics - more properly called political economy - is a side of ethics” (Skidelsky (2000:264)). This is a point that is commonly not seen by the general public who associate economics with money, and neither by many economists who don’t appreciate the subject of political economy.
(b) Ethics focusses on survival and the good life (“flourishing”). That is, just like laboratory animals require an optimal environment, humans have their own conditions for flourishing. Csikszentmihalyi (1997), “Living well. The psychology of everyday life”, clarifies the required balance between challenge and competence: too much challenge causes stress while too little challenge causes boredom. The Rasch model, also known in psychology as the item-response model, or the Elo model used for Elo rating in chess, seems to fit the situation.
(c) Colignatus (2003), “On the value of life”, essentially focusses on survival: the lifeyears saved and the allocation over individuals. On the quality of life, the “flourishing”, I only have a rough outline “On the price of health”.
(d) The chapter “Without time, no morality” of course links with the discussion in chapter 19 on determinism and free will, and the general importance of ‘dynamics’ for this book.
(e) There was a seminar by McCloskey on virtue ethics that was illuminating and that I can advise to who has a chance to attend. Smith (1759, 1984), “The theory of moral sentiments”, featured strongly.
(f) A general point in ethical theory is that people aren’t really ‘souvereign consumers’. They grow from dependent children to mature adults to dependent seniors, so that there is always a degree of dependency. Political economy takes this into account. The standard economic approach that assumes souvereign consumers however can still be useful for analysis even while being limited in this respect.
(g) Another point concerns the distinction between ‘rules’ and ‘rhetorics’. In ethics, it does not suffice to have rules only, since these must be applied to practical situations – where rhetorics apply. In law, there are not only laws but also courts. Current literature in economics tends to emphasize rules. If economics had courts too then there might be less imbalance. The suggestion that there be economic courts links with the idea of an Economic Supreme Court.
(h) There are some other advisable books that enrich our understanding of humanity, (social) behaviour, ethics and its biological roots, which form the input for and target of political economy. Tiger (1992), “The pursuit of pleasure”, mollifies the economistic calculus of utility, which at the same time clarifies that it still can be useful to use small abstract (simplistic) models to develop arguments that can improve the lifes of many. Damasio (2003), “Looking for Spinoza”, delves into the brain to understand human emotion and feeling. Though many dimensions exist, there still is the pain and pleasure dichotomy that links to ethics. Damasio also notes that biological ‘emotions’ (generally) arise split-seconds before being reflected in ‘feeling’ in the mind. This phenomenon raises the question of ‘free will’ and the reader is referred to that section in chapter 19 above. De Waal (2001), “Tree of origin”, discusses whether primate behavior can tell us something about human social behaviour, and the same themes arise. Cavalli-Sforza (2000), “Genes, peoples and languages”, focusses on recent human evolution. Diamond (1997), “Guns, germs, and steel”, makes us aware of the impact of mere geography. All these books clarify that political economy can be of value for humanity by keeping an open eye for the study of humanity itself.
(i) Cavalli-Sforza (2000:207) concludes with this statement: “It will be necessary, for example, to be more successful in spreading the necessary moral values to the whole world. Is the amount of deception, hatred, exploitation, and unrestrained selfishness we observe in almost every society inevitable ? We need not be too pessimistic and should admit that people do not always display their worst qualities. But it would be valuable to learn exactly the conditions that elicit these destructive tendencies, in order to systematically prevent them. Overpopulation and extreme competition for valuable resources undoubtedly contribute. Our aptitute for social engineering is limited, although we must become more serious about work in this area, so as to end - or at least reduce - major social ills such as poverty, ignorance, population growth, racism, drug addiction, crime, and other social epidemic and endemic diseases that afflict us. Our efforts in this regard can be helped by studying cultural transmission and the forces of conservatism that hinder useful innovations, as well as the danger posed by promoting and accepting great changes too soon.” I can only agree with this, and the current book fits this objective.
It has been a cause of wonder for the present author why other economists are not more outspoken on the Tax Void, and why above theorem on the possibility of returning to full employment meets such disbelief as it apparently does. In the course of time, I found that the following issue forms part of the explanation.
Many economists think that there are no free lunches. It may even be a dogma or mantra to them. With this general attitude, they close their eyes to the free lunch that presently exists in the inefficient labour market. They adhere to their ‘no free lunch’ philosophy regardless of what arguments other people forward. My diagnosis is that this is one of the reasons why the debate on unemployment is rather stuck.
It actually can be shown that the economy is full of free lunches. We will discuss two examples below, namely the examples of the consumers surplus and economic growth. By regarding these examples we will better appreciate the nature and significance (as Robbins might say) of a free lunch. When the possibility of a free lunch is accepted, then we can discuss unemployment in more realistic terms.
The American science fiction writer Robert Heinlein once created a rough Moon Colony where the rules of the free market are exploited to their limits. In this colony the phrase “Your money or your life” is not a criminal threat but a sound business proposal - and a bargain for many as well. In the same vein all incidents in the novel are subject to bets - and after some consideration, the reader of this novel may well accept this as a useful system of rational contingent forward markets. Then, properly, the slogan & law of this Moon Colony is TANSTAAFL: “There Aint No Such Thing As A Free Lunch”.
TANSTAAFL is rather “accepted wisdom” in the economics profession, and not something that is subject to critical discussion. There are only few explicit statements on the supposed absence of a free lunch. A recent statement is by Cnossen & Van Ewijk (1995):
“No society limited in resources can for a moment proceed from the premise [sic] that there is such a thing as a free lunch. Dispassionate analysis of the problem and hard-headed calculation of the costs of alternative courses of action are called for. This applies especially to the economics discipline, which gives center stage to the concept of opportunity costs.”
So, evidently, in the views of these authors, people disagreeing to their views on this issue are emotional or soft-headed !
Coase (1994:200) has a fine anecdote:
“Charles Walgreen in 1936 withdrew his niece from the University of Chicago because he had been informed that the university taught free love and communism. I know nothing about the university’s teaching on communism but presumably Mr. Walgreen would not have been mollified to learn that the true Chicago view is that there is no such thing as a free love. Eventually, however, Mr. Walgreen was convinced that he had been misinformed (...)”
The British newspaper The Economist (1994b) and the Dutch economist Van Bergeijk (1994) state, in reaction to proposals by Snower, that there would be no free lunch on the labour market. Even with current unemployment, it would not be possible to change taxes, contributions and benefits in such manner that this would raise employment opportunities for the unemployed without other agents having to pay some bill.
These latter authors use arguments for their views. So their judgement does not seem dogmatic. However, their arguments have been refuted. Authors like Snower and myself, and many others, have also pointed to the possibilities for improvement in the labour market, and these arguments have not met with convincing rejections. So it may well be that TANSTAAFL works its ways in the back of the minds and hinders proper balancing of arguments.
We somehow might welcome the Cnossen & Van Ewijk statement, since it makes explicit what often is only implicit. In the following I shall deal with the problem in general. I hope to banish TANSTAAFL to the domain of science fiction, so that thereafter we can discuss the labour market in more useful terms.
The more innocent examples of free lunches happen around us every day. For example, in a free country, a transaction occurs only when both parties get something out of it. TANSTAAFL adepts will hold that when there is a transaction, and people pay for their lunch, then there clearly is no free lunch. However, the theory of the consumers surplus reminds us that you may pay for your lunch, but likely not as much as you might be willing to pay. If you would not get more out of it, there would be little point is actually doing the transaction. In everyday life, we see few people exchanging dollars for dollars, just for the fun of it. So if p is what you pay for your lunch, and if wtp is your willingness to pay, then wtp - p is your free lunch.
One might argue that the TANSTAAFL conjecture properly reads
that p
0.
Thus TANSTAAFL-ists accept that wtp > p, but the point would be that
you have to invest a nonzero amount before you can reap greater benefits. It
would seem to me that the following is the proper reaction to this:
1. We
might accept a definition that ‘no free lunch’ means p
0.
2. However, that definition does not warrant universal truth. Some goods have p = 0, notably endowments, ideas and, in a sense, public goods.
3. So, please then, do not use this mal-definition to kill arguments on the labour market that concern new ideas.
4. And,
please see the point that it may be advisable to define ‘p
0’
‘there are some costs’,
and ‘wtp > p’
‘there is a free lunch’.
In a sense, the discussion might only be about words. But there are also emotional connotations involved, that should cause us to be rather careful in that choice.
Economic growth is another instance of manna from heaven, and also a phenomenon that has been with us since the dawn of mankind.
An invention in one industry will generally have consequences for the entire economy. The industry of origin can seldom claim all proceeds. When the optimal ratio of production factors changes, then prices change. E.g. just by mentioning the possibility of other prices, one signals to the other parties that there is room for discussion. The other parties will use that room, and their knowledge and possessions, to claim part of the economic value of any innovation. Other parties have had no effort in bringing about the innovation, but they consider themselves partners in the industry, they know their leverage, and, thus, exploit it. Their advantage not only concerns the consequences of a better product, but also an improvement of their income position.
In a general equilibrium framework we consider an economy with 400 units of labour and 600 units of capital. The economy produces food and clothing, and a social welfare function (SWF) determines the optimal combination. Here, our SWF will be a Cobb-Douglas function that neglects the distribution of income:
(SWF)
Labour a en capital k are allocated to the food (v) and clothing (k) industries via av + ak = 400 and kv + kk = 600. Industrial output is determined by the production functions. Here we take CES-functions, that have a constant elasticity of substitution between capital and labour:
![]()
![]()
Equilibrium and the optimum are found at 278 units of food and 253 units of clothing, with a distribution of the factors of production of av/ak = 299/101 and kv/kk = 210/390.
The allocation can be shown using two figures. Figure 36 confronts the social welfare function with the Production Possibility Curve (PPC).
Figure 36: Social Welfare and the Production Possibility Curve

The PPC gives those combinations of food and clothing that can be produced with the scarce resources. The choice of the highest possible value of the SWF generates a tangent of a contour of the SWF with the PPC. The tangent gives the optimal price ratio (thus trading ratio) of food and clothing.
Figure 37 confronts the production functions of the separate industries in an Edgeworth-Bowley diagram. The food industry has its origin in the lower left-hand corner, and the clothing industry has its origin in the top right-hand corner. The amounts of capital and labour that are not allocated to the food industry are allocated to the clothing industry. The drawn contour for the food industry gives those combinations of capital and labour that produce the same amount of food. That contour is touched in a tangent by a contour of the clothing industry. The collection of all tangency points is called the contract curve. The tangent drawn here passes through the optimum selected by the SWF. This tangent thus also determines the price ratio of wages and capital rent.
Now we assume that there is an innovation in the clothing industry. This innovation can be of technical or organisational origin, and it causes that the same garment can be produced with a little less labour but a little more capital. To be concrete: the production possibility is discovered that can be stated in the production function clothing = CES[0.2, 0.5]. Is this innovation useful ? The answer appears to be that labour is the factor that is relatively scarce and that this innovation allows its better use, so that welfare can rise to 282 units of food and 269 units of clothing. The allocation of factors of production becomes av/ak = 309/91 and kv/kk = 202/398.
Figure 37: Edgeworth-Bowley diagram for the factors of production

Figure 38 and Figure 39 present the same plots as before so that one may see how the economy changes. The figures speak for themselves. It will be clear that our analysis is comparative statics. How quickly the prices change, and how quickly the agents react, will be a question of dynamics.
Figure 38: SWF and PPC of two situations

Figure 39: Edgeworth-Bowley of two situations

Above model was not perfect but helps us to understand how a free lunch percolates through the economy. It helps us to understand what a free lunch actually is.
In above model, the innovation falls from heaven like manna. The innovation is the free lunch. One may see the tautology: If you accept the model, then there is a free lunch; and you accept the model if you see innovation as a free lunch.
One may hold that above model is incomplete. One would want to introduce a separate R&D sector, and then there will be a balancing of R&D costs and the expected increase in national income. As an economist, I’m very much in favour of developing such models. However, actually doing this only moves the question one station further, and does not answer the proper question. For, it is possible that an economy spends 99% of its resources to R&D, and still does not come up with innovations. Good ideas remain like manna from heaven.
You may hold the view that agents already expect economic growth, so that they will not regard it as a free lunch. This reminds of the attitude of some children of rich parents who expect a rich inheritance and who don’t show gratitude for their daily bread. The point to note, though, is that the concept of a free lunch is not an expectational variable, but one of circumstance. There is a free lunch or not, whatever one expects. Indeed, as another example, our wealth is a cumulation of free lunches in the past. That we don’t experience this as a free lunch anymore, is more a sign that we are spoiled, rather than a sign of our dynastic rationality.
And even if we would design a revised expectational concept of a free lunch: then perfect foresight or rational expectations are only assumptions. There is always the possibility of a surprise idea. The future is uncertain (though predictable) - even though our scientific predisposition is deterministic.
Let me rephrase the point that I want to make here. There are data (exogenes or endowments such as soil, sun, technical relations and the like), the economy depends on the use of these, and the development of the economy can be described in terms of the developments in these data. The data are for free. Ideas are part of these data, and the (major) source of uncertainty. In this terminology, there are free lunches by definition. That is the crux. When economists better deal with their definitions, we get better economics.
Our discussion on the consumers surplus showed that much may be a matter of words. However, using an abstract argument and a concrete small general equilibrium model, we showed that innovation and economic growth are an example of a free lunch for the whole economy. Our intention was to refute the attitude of “there aint no such thing as a free lunch”. Hopefully, this refutation creates more room for discussion of proposals concerning the present immense inefficiency on the labour market. The latter discussion is especially important, since the major proposals for solving the inefficiency concern ideas by impartial economists.
Note 1999: I was afraid that I would clash with Paul Krugman on this issue, since he has a Fortune column ‘No Free Lunch’. To my great relief, Krugman (1999:167) however writes: “And this brings us to the deepest sense in which depression economics has returned. The quitessential economic sentence is supposed to be “There is no free lunch.”; it says that there are limited resources, that to have more of one thing you must accept less of another, that there is no gain without pain. Depression economics, however, is the study of situations where there is a free lunch, if we can only figure out how to get our hands on it, because there are unemployed resources that could be put to work. In 1930 John Maynard Keynes wrote that “we have involved ourselves in a colossal muddle, having blundered in the control of a delicate machine, the working of which we do not understand.” The true scarcity in his world - and ours - was therefor not of resources, or even of virtue, but of understanding.” Hurray!
This discussion will present proper definitions for uncertainty and risk. Such definitions are required since the current definitions in common use are rather erroneous and generate conceptual problems.
The new definitions are - see also Figure 40:
(1) First there is the distinction between certainty and uncertainty.
(2) Uncertainty forks into known categories and unknown categories.
(3) Known categories forks into known and unknown probabilities.
(4) Unknown probabilities forks into assuming a uniform distribution (Laplace) or use non-probabilistic techniques like minimax or neglect.
Note that these definitions only use certainty, knowledge and the distinction about categories (category-uncertainty), and that they do not use the term ‘risk’. Thus an independent definition of ‘risk’ is possible.
A.S. Hornby (1985) “Oxford Advanced Learner’s Dictionary of Current English” defines ‘uncertain’ as: “1 changeable; not reliable: ~ weather; a man with an ~ temper. 2 not certainly knowing or known: be/feel ~ (about) what to do next; a woman of ~ age, one whose age cannot be guessed”. The above fits this.
Figure 40: A diagram of the new definitions

Hornby (1985) defines ‘risk’ as: “(instance of) possibility or chance of meeting danger, suffering loss, injury, etc.” Also: “at the ~ risk of / at ~ of, with the possibility of (loss etc.)”.
Thus, if there are possible outcomes O = {o1, o2, ..., on}, then the situation is risky if at least
one of the o’s represents a loss. The risks are the oi that are losses, thus Risks[O] = {oi
O | oi
is a loss}. The risk factors are the positions or index numbers of the risky
outcomes, the i’s, or the dimensions (the causes that make such positions to be
filled).
We will use the term ‘valued risk’ when a risk is valued with money or utility. When all risks have been made comparable by valuing them, then we can add them, and we will use the term expected risk value for the expected value of the ‘valued risks’. Then, crucially, once these definitions are well understood, then we may also use ‘the risk’ for the expected risk value. [110]
With such understanding, risk will be r = -Ex<0[x] [111] or for short r = -E[x < 0]. [112]
Valued risk deals with the cases when probabilities are known or when unknowns are assumed to be uniformly distributed over known categories. It is not customary to use the term ‘risk’ for unknown categories. For example, it is uncommon to say, or write economics papers about this, that “all our lives are at risk of a suddenly imploding universe, or black hole hitting Earth, or waking up as a cockroaches”. Such real ‘Acts of God’ are commonly neglected. Note though that it still remains possible to say that a situation is risky even though one cannot put a number to it. Above expectation may be indeterminate since one may lack knowledge about the probability distribution or even the categories.
Relative risk is defined as r(t) = t - E[x < t] for some target level t. Risk (or absolute risk) takes t = 0, and relative risk would allow for a different target level. [113]
An interesting application is when x is a stochastic rate of return and r the certain rate, so that there is relative risk r(r) = r - E[x < r]. This relative risk answers the question: What is the probable loss with respect to a target return of r ? Here, r - r(r) = E[x < r] gives the weight of underperformance in the total target return (which weight has to be compensated by probable profits to achieve the target).
Conditional (relative) risk is defined as k(t) = t - E[x | x < t] for some target level t. With respect to rates of return, conditional risk k(r) answers the question: What would one expect to lose with respect to r, if earnings actually underperform and fall below r. Indeed, r - k(r) would give your expected return when actually underperforming.
Conditional risk is related to relative risk by the property that E[x | x < t] = E[x < t] / Pr[x < t]. The probable loss thus is corrected for the probability of the loss. Or, the probability measure in the expectation is corrected so that a density is taken that sums to 1. [114]
In everyday parlance, profit and loss are nonnegative concepts. For example, if the difference between revenue and costs is $-10, then your loss is $10. It is only in mathematical economics that profits are defined as a general profit function such that ‘negative profits’ are possible. To understand risk, we however return to the everyday parlance convention.
Let us have a prospect that can give profit with probability p, and loss with probability 1 - p. We denote this as Prospect[profit, -loss, p]. We call profit * p ‘probable profit’ and loss * (1 - p) ‘probable loss’. Then the following definitions apply:
·
Expected Value =
= p profit + (1 - p) (-loss) = probable profit -
probable loss
· Risk = risk value = expected value of the risks = probable loss = (1 - p) loss
· Risk Ratio = Risk / (ExpectedValue + Risk) = (1 - p) loss / (p profit)
· Thus: Expected Value = p profit (1 - Risk Ratio)
· Risk Probability = cumulative probability of all losses (in this case 1-p)
Risk is the (absolute value of the) down side of a bet. A venture is judged to be risky if the probable loss is large. Note that this notion still is somewhat vague. A probable loss can be large because of the probability or because of the sum of money involved. This vagueness is unfortunate, in some respects, but here is little to be done about it, since this vagueness is inherent in working with probabilities. In fact, this vagueness is an essentially positive aspect of working with probabilities. For, when we have different prospects, then we can order and evaluate them on risk, neglecting differences in losses and probabilities.
Colignatus (1999, 1999a) further develops these notions for simple binary prospects, multidimensional prospects, joint prospects, and continuous probability densities. An interesting application is the ‘Markowitz efficiency frontier’, but now with risk rather than the spread.
The above definitions are proper in the sense that they conform to every day parlance and the definitions provided by Hornby’s dictionary op. cit.. The definitions provided here however differ from the use within the economics literature. First there are the definitions of Knight (1921) that have been adopted widely in economics, as for example in The New Palgrave (1998:III:358). Or it has become custom in finance to associate risk with the standard deviation. And some mathematical statisticians use another concept of risk. Let us discuss these in turn.
The New Palgrave, Eatwell c.s. (1998:III:358), gives the current common view:
“The most fundamental distinction in this branch of economic theory, due tot Knight (1921), is that of risk versus uncertainty. A situation is said to involve risk if the randomness facing an economic agent can be expressed in terms of specific numerical probabilities (these probabilities may either be objectively specified as with lottery tickets, or else reflect the individual’s own subjective beliefs). On the other hand, situations where the agent cannot (or does not) assign actual probabilities to the alternative possible occurences are said to involve uncertainty.”
Indeed, most economic texts use this distinction in this manner (at least, up to now). However, I cannot disagree more. The objections to Knight’s concept are:
(a) Certainty and uncertainty are binary. So, if a situation is not uncertain, then we have certainty, and there is no assigning of probabilities.
(b) If I am uncertain about a situation and assign equal probabilities to all cases - the Laplace suggestion - then according to Knight this no longer is uncertainty!
(c) In Hornby’s definition, the distinction is not between known and unknown probabilities, but the distinction is between events and human thought.
Figure 41 contains a diagram of the objectionable use of terms 1921-2005.
Figure 41: A diagram of the current but objectionable use of terms

The diagram clarifies the inconsistency with the binary character of certainty/uncertainty, the curious treatment of “Laplace”, and the over-use of terms by introducing the term ‘risk’ where there already is the qualification that the probabilities are known.
The finance literature often uses the term ‘risk’ for the variance or spread (standard deviation) of the distribution of the rates of return of investments. This would be an improper use of the term. Suppose that one has a very profitable venture without the possibility of a loss. Suppose that the rate of return of this venture has a large variance, from mildly profitable to highly profitable. Is this a risky venture ? No, not in the usual understanding of the term.
In mathematical statistics, some authors, like Ferguson (1967), define ‘risk’ as ‘expected loss’. However, it appears that they actually regard ‘loss’ as the negative of total returns (i.e. - revenue), so the definition used is -(p profit + (1-p) (-loss)), which is the negated expected value. This use of the term ‘risk’ is inappropriate. My proposal is to use the word “due” to stand for the negative of expected value, so that the standard statistical decision theory (with the game against nature) can be described as minimising due.
I came across Bernstein (1996) “Against the gods”, and found it equally entertaining as his “Capital Ideas”. One comment is that Bernstein indeed emphasises Knight’s and Keynes’s statements on “uncertainty”. My answer to that is, again, that unknown probabilities or even unknown categories indeed are serious cases of uncertainty, so that earlier writers on the subject were right in emphasising that seriousness. However, we should not be tempted to reserve the word “uncertainty” to only those cases. So with all due respect to Knight and Keynes, the definitions provided here are the proper ones.
Wilson & Crouch (2001), “Risk-benefit analysis”, adopt
the same definition of “risk” as discussed here. I saw this only after the
first edition of this book. Since professor Wilson has been teaching on the
subject for decades and his book only collects his teaching material I
apparently only rediscovered what was already clear to him. Perhaps my
presentation is a bit clearer since I use the formal E[.] notation. This
chapter remains useful since it clarifies the confusions from the other
definitions. Where risk is the product of probability and severity, this book
also benefits from the emphasis on this definition, since, where I started to
develop this argument after the Fall of the Berlin Wall in 1989, we have to
deal with a future where there are huge dangers: though with only a small
probability but on balance a relevant risk.
Above we noted that the structural form of western welfare states is quite complicated. We would like to have a more enduring result than awareness of complexity, and therefor we adopt the Definition & Reality methodology. As said, a proposition - as a statement on reality - can be regarded as a mathematical theorem about/within a model of stylized facts. When there is a tautology, we attain truth by definition. So we now (a) restate what we consider to be the stylized facts, (b) define our concepts, (c) develop theorems and proofs, (d) link back to conclusions about reality.
The reduced form that is most relevant concerns the (long run) comparative statics of the regimes of full employment (1950-1970; Japan/Sweden) and unemployment (1970-2005).
This kind of comparative statics should not induce us to think that we abolish dynamics, though. Stagflation has both a dynamic (inflation) and a static or stationary (unemployment) aspect. When we skip proper dynamics and discuss regime switches in which unemployment features as an important switch variable, then Phillipscurve processes are included in the switching process, even though they don’t feature explicitly in the reduced form.
To attain the necessary level of generality, we use a reduced form where the economy is mapped into a model with three types of agents. One type is the net receiver; and two types are net tax payers. Since the latter two points give a line, that single line represents the state of the economy. The regime switch depends upon the choice of tax parameters.
There are regimes of full employment (1950-1970; Japan/Sweden) and unemployment (1970-2005).
In the welfare state, it is more efficient to have full employment. Unemployment causes lower income - not only directly as in old-fashioned capitalism but also, more noteworthy, by the additional benefit burden. Unemployment can have an adverse effect on inflation when it causes a shift of the Phillipscurve.
It turns out that the propositions that are most interesting, from the viewpoint of political economy, do not require continuity, and can be formulated by assuming dichotomous High and Low productivity labour, combined with one class of Benefit recipients. This assumption allows for a reduced form formulation that allows for generality. For expository reasons we can take social subsistence and productivities as purely constant. In the simple mathematical model the dichotomy gives fixed numbers, in actual observation they are subgroup averages which depend upon general equilibrium processes. The benefit level is rather not an average but a threshold, like the surface of the sea at Scheveningen beach. The words Benefit, High, and Low give letters BHL, and this abbreviation may be pronounced - converged upon after many walks - as ‘beachly’.
It is a stylized fact that welfare states are BHL. Checking this requires next definitions.
Here we will redefine variables such as H, Z, b, n etcetera. Also the reduced tax function will be T(.) as opposed to structural T[.]. These redefinitions hold for this chapter 39 and chapter 40 - that together form a reduced form unity.
Definition: Biological subsistence, for survival, is S.
Definition: An
economy is a welfare state iff people without income are not left to
charity, stealing or death, but get a benefit B. The benefit B
has the following properties:
i. the net benefit has the social subsistence level B
S,
ii. people on benefit may not work, [115]
iii. eligible are:
iii-a. permanent benefit recipients (e.g. ‘the elderly’)
iii-b. people able to work but currently unable to earn at least
net B (these people are called ‘the unemployed’).
Remark: it is useful to have category (iii-a) in the model. It introduces a degree of sufficient complexity. When there are levies even under full employment, then it is easier to understand that wrong co-ordination may cause a switch to unemployment. But (iii-a) might count zero people.
Remark: Property (iii-b) has
the effect of a legal minimum wage. It sets a floor in the market. We might
introduce a benefit threshold (for workers) XB such that S
XB < B, but for
expository reasons, we take XB = B.
Remark: The reservation wage effect is as follows. When vacancies with net income higher than B are registered, then the relevant unemployment benefits are simply scratched. This mimics the array of measures needed for continuous reality.
Remark: This definition implies that people working with subsidies in the Swedish/Japanese case are not on ‘benefit’. Such subsidies thus must be accounted differently, basically as part of taxes.
Remark: The black economy (another form of working while on welfare) is neglected. We neglect also the case that some people hate being on welfare, and thus continue working even when their net earnings are below the benefit threshold (S < net earnings < XB ).
Definition: A welfare state is bhl iff it remains meaningful to trisect its membership into the economic classes of Low and High productivity workers and permanent Benefit recipients.
Definition: A welfare state is nonrevolutionary, iff its economic classes and their data are stable across the change of employment regime.
Definition: A welfare state is BHL iff it is bhl and nonrevolutionary.
Remark: Denote High and Low gross productivity as H and L. Note that B is net. Also bhl-ness technically
implies H >> L
B.
Remark: L may be associated with a minimum wage and H with some average income including profits.
Remark: An example of ‘meaningful’ are subgroup subperiod averages.
Remark: Stability can sometimes be found by normalizing, e.g. take subperiod H(t) as the subperiod numeraire.
Remark: A person’s benefit is often related to the former period working wage. However, anything can be clustered into a social subsistence average. People ‘between jobs’ could be taken to be basically in the employed cluster, people with serious unemployment could be in the other cluster. Don’t object that this makes the matter tautological - since that is exactly what we try to do. (We try to find the definitions that make our understanding tautological.)
Remark: A nonrevolutionary welfare state still allows for politics and economic change.
Lemma I: A welfare state is BHL iff there is stability over the regimes for the variables B, H, L and the associated numbers of agents.
Proof: Self evident. Q.E.D.
Remark: The relevant notion is that the change from unemployment towards full employment (or vice versa) does not destroy the productive base of the economy. Instead of taking this notion explicitly, we have taken a stronger property of nonrevolutionarity, that allows, if bhl-ness applies too, to take (approximate) constancy of the variables.
Remark: At first glance these definitions seem self-defeating for the effort to apply the mathematical method to employment regime switches. When 35 million, nowadays unemployed in the OECD, are supposed to find a job, then apparently the policy maker is supposed to be able to judge on the ‘stabilities’ involved. That seems an impossibly strong assumption. We may however remind about the regime switch from 1950-1970 to 1970-2005. In addition, as modellers we discuss equilibrium states of various paths. Also, it is possible to give the variables an incremental interpretation, e.g. take 34 of the 35 (million) as permanently on benefit, and only look at 1 million on the margin (giving “local-BHL-ness”).
Lemma II: For a welfare state, the (apparent) existence of people with a productivity L’< B, does not block the application of BHL-ness.
Proof: Consider the pathological case of people with productivity L’< B, i.e. so low that (in whatever regime) their net market income is lower than B. Take the dentists, who in a regulated market cannot start a practice, and who are very bad at farming in a flowerpot (which could be done with a Cobb-Douglas production function). These people can be treated as:
(1) society is willing to classify them as (iii-a)
(2) like the Swedish/Japanese approach, they may keep on working with some employer subsidy Z; in that case L = L’ + Z
(3) society lowers B to B = S or B = L’, and reconsiders the problem
(4) if regulations are the bottleneck, then changing these regulations redefines ‘given’ productivity L’. Similarly, if Keynesian methods solve unemployment, then only if people’s effective productivity is restored. So the reduced form applies anyhow. (In that case the regulation or lack of a policy measure is a tax in terms of the reduced form, and ‘real productivity’ is higher than L’.)
(5) they get charity, steal or die, and hence there is no welfare state.
Hence BHL-ness implies that these cases can be ‘averaged out of the discussion’ or be left out for expository reasons.
Q.E.D.
Remark: In other words, BHL-ness is sufficient for discussing employment in the welfare state (but not necessarily for other topics, for example, how regulations affect productivity).
Theorem BHL.1: For a BHL economy, both full employment and unemployment are possible.
Proof:
The structure of this proof is, that we determine the accounting equations, find the reduced form tax relations that are implicit in these, and then deduce the critical tax parameters that determine the regime switch.
Looking at the BHL concept, the only possibility for variation is in category (iii-b). The recipients in that class all move together, and thus there are only two regimes (in or out of benefit dependency). Given that gross productivity has been fixed, the only possible variation concerns net income. We assign the term “tax regime” to the possible states in net income. We find, in other words, that these regimes are implicit in the BHL concept. Let t be the index for tax regime 0 (unemployment) or 1 (full employment).
Given BHL-ness, we thus have: t is 0 or 1, and:
b permanent benefit recipients;
h persons with gross productivity H and net N(t);
l persons with gross productivity L << H, and net K(t).
The regimes are characterized by net income conditions K(0)
< B and K(1)
B:
(0) In regime 0, K(0) < B and l are eligible for benefit B, and they don’t work.
(1) In regime 1, K(1)
B and l don’t get
benefit B, and they work and earn L.
On benefit, the welfare rule is strict on not-working, while
by assumption the black economy can be neglected. Off benefit, the l
have no other means of support and thus work, and earn gross L. Since
net income cannot be larger, L
K(1)
B.
In the following equations, personal income y takes values H and L. Relation (1-t) below gives the implied tax system, where the personal tax T(y, t) depends upon personal income y and the tax regime t:
T(H, t)
H - N(t) ;
T(L, t)
L - K(t) (1-t)
Two points share a line. Hence, the tax system can be represented by a straight line, with an intercept and a marginal tariff. These implied ‘parameters’ (actually: reduced form variables) are defined in (2-t), with 2 pairs of 2 equations & 2 unknowns, giving tax exemption X(t) and marginal rate R(t). The line is the reduced form representation, while the statutory system which guides people’s actions could be anything. Each regime gives a set of reduced form lines; our interest concerns the boundary line.
R(t) (y - X(t))
T(y, t)
(2-t)
Relation (3-t) defines national income Y(t), where the personal incomes are multiplied by the numbers of persons involved. Revenues h H + b 0 = h H are regime independent. Depending upon the regime the l bring in L or not.
Y(t)
h H + t l L + b 0
(3-t)
Relation (4-t) states the condition of a balanced budget. National income equals the sum of net incomes after redistribution. The condition may be called “Walras’ Law”.
Y(0) = h H = h N(0) + (l + b) B (4-0)
or h T(H, 0) = (l + b) B
Y(1) = h H + l L = h N(1) + l K(1) + b B (4-1)
or h T(H, 1) + l T(L, 1) = b B
The budget condition implies that the tax ‘parameters’ are
functions of each other. Per regime, a higher exemption means a higher marginal
tariff, and vice versa. The regime switch itself might, but need not, be the
exception. Given that marginal rates R are generally regarded as policy
variables, we solve for X. With X(1)
L:
(4-0) h R(0) (H - X(0))
= (l + b) B 
X(0) = H - (l + b) B / (h R(0)) (5-0)
(4-1) h R(1) (H - X(1))
+ l R(1) (L - X(1)) = b B 
X(1) = (h H + l L - b B / R(1)) / (h + l) (5-1)
There is a set of critical levels of gross income M(t) = M(R(t), t), such that unemployment results iff earnings L are less than M(t). This follows directly from rule (iii-b). This critical income solves from:
M(t) - T(M(t),
t)
B
M(t) = M(R(t), t) = (B - R(t) X(t)) / (1 - R(t)) (6-t)
Under unemployment, the benefits cause additional taxes l.B
which are levied on a smaller tax base. Given that l are unemployed
anyway, the tax exemption X(0) can be lowered, so that the marginal rate is as low as possible. This has the effect that M(0) shifts to the right,
so that the gap between the possible wage L and the wage ‘required for a
decent living’ widens. There is obviously hysteresis, of a ‘catastrophic’ kind.
Conversely, M(1) can range in B
M(1)
L and allow for larger R(1) though
this could have little effect since also X(1) rises (see below).
While these properties apply to the reduced form, the same mechanisms
apparently apply to the structural form too (as they concern the same reality).
Substituting (5-t) in (6-t) gives M(t) as an explicit function of R(t). The regime switch occurs at M(1) = M(RS, 1) = L with switch marginal rate RS and implied exemption XS:
bB - (h +
l) (L - B)
RS =
-----------------------------
(7-RS)
h (H - L)
bB - (hH/L
+ l) (L - B)
XS = L ---------------------------------- (7-XS)
bB - (h + l) (L
- B)
Rewriting conditions K(0) < B and K(1)
B gives:
{L - T(L, t)
< B}
{ X(t)
XS & L < M(t)}
(8-t)
{L - T(L, t)
> B}
{X(t)
XS & L > M(t)} (9-t)
Now consider the regimes, and determine whether they can exist:
Full employment: Given that L > B, it
follows from (9-1) that the tax exemption can be chosen on or above the
critical value XS. Hence XS
X(1) < H. A prime example is X(1)
= B. Hence (iii-b) is empty.
Unemployment: L is given as the market clearing wage for low productivity persons. If X(0) < XS, then taxes on these persons are increased, and their net income drops below B. Given that K(0) < B, they are eligible for benefits, and apply. Hence (iii-b) is not empty.
It has been shown that both cases are possible. Q.E.D.
Remark: This exposition may seem an overly complex translation of the Cohen Stuart 1889 quote (above) to the welfare state situation. The proof might have said “self-evident” after the first paragraph. Given the record of unnecessary unemployment, this author may however be excused for driving the point home. The usefulness of the BHL concept may be, that officials now can report, “we have diagnosed l people on benefit who should be able to earn L > B on the market, so let’s try to find out how we are stopping them from doing so”.
Remark: A more didactic exposition may start with a structural tax relation, e.g. with R(t) replaced by r in (2-t); see for example the Bentham tax. Then one can show that a ceteris paribus reduction of the tax exemption will increase unemployment. Hence, for the return of full employment it is necessary (but not sufficient) to increase income tax exemption - or something from the ceteris paribus part. Then, the second step in the exposition (as we have done here) is to rename the axis into compounded variables (including VAT, regulations, subsidies, excises, charity, etcetera), and then consider (2-t) as the reduced form. Then we find necessary and sufficient conditions. This however only works satisfactorily for an accepted model of a real economy.
Remark: The theorem doesn’t establish that unemployment has only one cause. Various kinds of unemployment have various causes. But, when various causes are mapped into the world of BHL-ness, then the theorem applies. For example, a long term unemployed academic would be categorised as unskilled labour, even though his employed colleagues earn much more. (The BHL concept thus is drastic. The reasons for applying it have been explained elsewhere.)
Remark: The theorem is strongest in the t = 1
t = 0
part. Given full employment, it is easy to mess it up; and it is easy
to see that you can mess it up. The other way around is less obvious. Here,
both the requirement L
B and Lemma II are crucial. For expository reasons those
are sufficient, but not as sharp as they could be. For example, we might accept
a small loss in H(1)
H(0), as long as net N(1)
N(0).
However, even then the analytical structure remains, that productivity L is assumed, so
that it doesn’t come as a big surprise that
employment is possible. This actually is similar to the Arrow-Debreu
setting,
where endowments are assumed, and full employment appears to be
possible. The
modern reader might be inclined towards assumptions that generate the
impossibility of full employment. (See for example the Grandmont (1983)
setting of expectatory mismatch.) However, each impossibility can be
questioned too. It is up to reality what model applies. Stated
differently: the value of above tautological theorem is that it helps us to
understand what is implicit in our concepts, so that we may be more aware in
observing whether these concepts apply. This fits in with our concept of a proposition.
Remark: The reduced form also captures the ‘physical tax’. The lack of infrastructure, machines or tools may ‘tax’ people - and once these have been provided, they could start earning income, and their earnings would, crucially, be larger than needed to pay for the equipment. Economists of course understand this concept of a physical tax - as the lack of efficient capital markets, or the frustration of those by taxes - but the crucial point is the abstract one. When people don’t earn anything, and the economist suggests to abolish some tax, then a listener may become upset, since how can you abolish something that people don’t pay ?
Diagrams help understanding the analysis. Figure 42 shows two tax regimes, T(y, 0) and T(y, 1), characterized by different exemptions X(0) and X(1), and different critical incomes M(0) and M(1). The main difference is net income at L. In regime 0, net income at L falls below subsistence, causing unemployment and higher taxes to pay for benefits.

It can be seen that T(y, 0) is above T(y, 1), or that average tax rates are lower under full employment. On the left section of the horizontal axis, X(0) < X(1). On the right section, since taxes in regime 0 are higher and levied on a smaller tax base, T(H, 0) > T(H, 1). Thus the effect on the average tax rate is clear. The effect on the marginal rate depends upon the numbers. The case depicted here, with a higher marginal rate in regime 1, is only one possibility; but it shows that a higher marginal rate can combine with actually lower taxes.
Chapter 40 showed the technical possibility of full employment for a welfare state. Chapter 34 showed that social choice is feasible, in the sense that there are consistent and reasonable constitutions that society might deem attractive. In particular, there is the example of a constitution that uses the efficiency criterion (Pareto optimality, PO) to select its policy. There still remains one issue to settle. This is the issue of information. Society might have a consistent preference, and consistently prefer full employment above unemployment, but when people don’t know that it is possible, and instead even have theories that tell them that full employment is impossible, then society might still choose for unemployment as the best of all evils. The issue of information already featured in our discussion of Arrow’s Theorem, and now returns for our discussion of unemployment.
We again follow the procedure given by our methodology. We select stylized facts, develop our concepts, deduce results, and link back to reality. We will first construct a subsidiary lemma that is very general and concerns any suboptimality due to misinformation. Then we take our theorem on the possibility of full employment, recognise it as an item of information, insert it, and construct our theorem on the possibility of co-ordination. [116]
Recorded full employment situations may have been caused by ‘chance’. Policy makers in 1950-1970 may have thought that functional finance was effective, while it also was the tax exemption level. A re-evaluation of the history may however also show that leading economic advisers in the 1950s may have been wiser than those of the 1960s.
It remains a stylized fact that much of the subject matter on employment is well-known. For example in Holland, CPB economists Van Schaaijk (1983), Bakhoven (1988) and Colignatus (1990) pointed the way to full employment. The state of knowledge turns out to be part of the model.
There is a Pareto Optimizing Change (POC) iff some advance and none suffer. A change from unemployment to employment need not be strictly POC. Note that we already have resolved that we don’t need high unemployment to keep inflation in check. So the CWIRU is no argument against a POC. There are other clear reasons that pose a problem. First these two:
· Some bureaucrats have plush jobs administrating the unemployed, and would lose their job and sense of power.
· The unemployed would lose their leisure. For some, the combination of low benefit B and leisure might be preferable to work at a higher income.
We can overcome these barriers by going back to basics, i.e. to our definitions. First of all, the bureaucrats are reminded that they are there to serve the public cause (‘res publica’) - and thus they have signed a contract - before they got the job - that they will welcome full employment and raise no anti-POC objections. In the same way, the people on the dole have signed a contract - before they got the benefit - that they will accept a job at a living wage, and will not raise anti-POC objections either.
A final observation is that the power elite, those who determine the SWF, might enjoy unemployment of a section of the population for some strange other reason. They might not care about the increase of income, freedom and welfare from a change towards full employment, but they would prefer the idea of people in helpless positions and the warm gratitude they show for their benefits. A king needs subjects. We resolve this problem by proper formulation of the theorem.
Note that we use the symbols of chapter 39 (that forms a unity with this chapter).
Above theorem on the technical possibility of full employment is essentially incomplete. It has not been specified how the tax regime comes about. The tax regime is an expression of the social choice already made, but it has not been explained how a particular choice has been caused. What is required is a power distribution on the b + h + l agents in the economy. In conventional terms the power distribution is expressed as a social welfare function SWF, and the tax regime is the result of the maximisation subject to the state of information I:
maximise SWF(h, H, N, l, L, K, b, B, t; I) (40.1)
Using a SWF serves expository purposes. When turning to practical application we could use the Drissen & Van Winden (1990) approach. But the logic of both approaches is the same.
The introduction of regime indicator t as a separate variable in the SWF means that it stands as a proxy. The economy is not simply a collection of individuals maximizing utility over consumption and labour. There are some institutional aspects too. An example of an institutional influence is that some social security officials might benefit from unemployment, since it keeps them in attractive jobs. All such (Public Choice) phenomena can be collected on their point of relevance: the employment regime t.
Secondly, there is information I. Ever since Keynes and Tinbergen, or even earlier, but for some economists more acutely since Muth and Lucas, economists have given attention to the information sets that guide the activity of agents. This concerns not just plain knowledge, but rather what people believe about the state of the world. The information sets may contain individual and social aspects, like own prices and the (announced) general price level.
Variable I is an aggregate. It represents the state of knowledge of those in power, where ‘having some power’ is a state of nature given by an array or by a distribution. The latter is not further developed here. A basic point however is that if some economist would know how to solve unemployment, but those in power don’t, then the budget set is IB, while I < IB - and those in power apparently prefer not to know. [117]
The use of variable I could complicate the analysis in various ways. R&D could be an economic activity affecting social welfare itself, amending (40.1) etcetera. But the present formulation suffices for our purposes. Note, the maximisation process itself finds its operational implementation in the actual work of some agents in the economy. Such work might be implicit and thus not explicitly remunerated. More conventionally there are some administrators (e.g. a “Council of Economic Advisers“) who are explicitly paid for their information handling activities (often: whatever outcome on t).
Piore (1987) reminds us that unemployment is not a natural disaster like an earthquake, but derives its cause, nature and significance from the social system as a whole. In this line, when unemployment arises, we would find the solution by studying the whole system. This includes information. And Piore’s reminder, being a reminder, is a piece of information. Indeed, one important social type of information concerns theory itself, and economic models in particular. The development of the theory of Rational Expectations (or model-consistency) implies this too. Economic theories about unemployment are themselves part of the information sets in society. An adequate description of unemployment not only requires a statement of taxes, social security and e.g. legal minimum wage, and their technical interaction, but also a statement of people’s perceptions, of the theories in the journals, and of what journalists and politicians make of these.
When unemployment arises, it may be caused by the power distribution, but the cause can also be plain lack of knowlegde. It may very well be that Piore’s proposition has not gotten sufficient attention from policy makers and advisers. And this lack of attention, if it were true, would be a prime example of the influence of the information set on economic activity.
There are two relevant states of information: I = 1 meaning that those in power perceive of a (sound, compact) solution of unemployment, and I = 0 meaning that this is not the case. Note that knowledge about the theorem on co-ordination, that is to be formulated next, might but need not be included itself in I = 1.
The Dissipation of Knowledge
I by science, education and media need not be
detriment to those in power, but it might be. In the latter case
I would not be POC in
the ordinary sense. However, many would hold that
I morally dominates POC - and if these
people are in power, then this conviction is reflected in the SWF. Note also that
I need not
be positive, e.g. when a wise king dies or a wise government party loses the
elections. Note that when
I
coincides with a shift in power, the prime cause can be both personal
properties involved or the information; but here everything is aggregated into
the latter.
We conclude this section by a short abstract discussion of the concept and properties of information, and Lemma III.
Regard a controlable dichotomous system with states s = 0 or s = 1. Two consecutive states are of the form {0, 0} and {1, 1} where the regimes are maintained, and {0, 1} and {1, 0} where there is a switch. If policy is conscious, then the movement from one state to the other (or the same) depends on information - and thus there are four lists of basic information. With 4 such items, an agent’ mind can possess any combination. There are 15 of such combinations: namely 1 case where all 4 are known, 4 cases of only 3 items, 6 of 2 items, and 4 cases when only 1 is known. It will be useful to compress this abundance.
The following definitions are useful:
Definition: Basic information is a list of “what one does” to have one state in one moment and another state in the next moment. An example list is: {“Provide oxygen and a dry place”, “Light the match”, “Let it burn till it is all cinders.”}. Other examples are recipes, film scripts, computer programs (“Click on a button”). We can denote basic information as BI(s1, s2). Note: In this version of the proof we allow basic information to be true or false.
Definition: A state s is said to be controlable iff there exists - in principle - true basic information on both s and 1-s, and the agents have the resources to use this information. Note that this information need not be known by the agents (need not be available), and it need not even be known to the agents that the matter is not unknowable.
Definition: Information is available when at least one agent in the economy has it. (This is stronger than the ‘existence in principle’ of controlability.)
Definition: Sound
information J(s) is a list of both what one does to maintain s
and what one might do to change s into 1 - s, using true
cause and effect relations. Thus J(s) = BI(s, s)
BI(s, 1-s)
| truth. Denote an arbritrary belief as J’(s) - that however will
not be sound since it would not be necessarily true.
Remark: True information is sound when the information concerning {1, 1} and {1, 0} is joined, or if the information on {0, 0} and {0, 1} is joined. One may e.g. know how to burn or not to burn a match, but not how to restore cinders into a match again (except for restarting the universe, but that is not likely controlable). Let 1 stand for match, and 0 for cinders. Then J(1) exists, but J(0) doesn’t (only partly, to maintain cinders as they are). Using sound information rather than basic information has analytical advantage. A Roman emperor may think that he maintains his good fortune by sacrificing to the gods. We rather discuss cases where governments deliberately abstain from wrong policies.
Remark: Consider the list {“If you happen to flip
back to 0, use BI(0, 1) to go back to 1”}. Can we classify this as BI(1,
1) ? We could allow this if the cost of the temporary flip is low. For example,
riding a bicycle requires continual readjustment of equilibrium. We can define BI(s,
s) = {chance(s, 1-s)}
BI(1-s, s) | truth, as
implied control information. But since this does not give BI(s,1-s),
the implied control information does not give sound information. Stated
differently, we are interested in durable states s, and not in flipping
states. If we observe s then we want this to be caused by deliberate
rejection of the use of BI(s, 1-s). We also regard cases
in which implied control would be costly.
Definition: The tuple (J(1), J(0), s) is the state of a sound system. Note: Though the information is denoted as a function of s, information in a controlable state is the prime cause and s the prime effect.
Definition: Information
is called compact iff J(0)
J(1). Note: Compactness means that
one knows the explanation of one state, iff one knows the explanation for the
other state. Then we can use a single variable J or J’.
Definition: A state s is said to be caused by chance iff a situation of s and unsound belief J’(s) are stable. It is said then that there is a hidden cause linking J(s) to s.
Definition: If
the sound information concerns a model then we can denote J in binary
values, with 1 = ‘the model is known’ and 0 = ‘the model isn’t known’, rather
than use the whole list of statements. With binary information, compactness J(0)
J(1) becomes
J(0) = J(1).
Remark: Consider the example of the Roman emperor.
His model is ‘sacrifice
fortune’ (and if fortune slips after a sacrifice, then
apparently more sacrifices are required). One of his basic informations is BI(~fortune,
fortune) = {‘sacrifice
fortune’, ‘In this case sacrifice’}. Since J’(1)
J’(0)
this is a compact belief.
Remark: If s is the case, and one doesn’t believe J(s), so that J(s) = 0, then one believes some alternative J’(s). Someone unfamiliar with matches would have the unsound (perhaps only basic) information ‘this is just a piece of wood’. More complex situations need thorough analysis. E.g. someone may know the text of a theorem and benefit from that, but may not know its proof.
Lemma III:
If there is sound information (J(1), J(0)) on a
controlable dichotomous state s, then:
(i) if the information is not compact then there are 8 states of the system,
with 4 states implying a hidden cause,
(ii) if the information is compact, these numbers are halved.
Proof:
We tabulate the possible states of the system (J(1), J(0), s) in Table 16.
In cases (rows) (3), (4), (6) and (7), the agent doesn’t possess sound information and believes some J(s) (e.g. ‘the world is as it is’), but he chances at s nevertheless. This implies that there is a hidden cause. (For example, the state of the system was inherited, and the agent wishes to keep things as they are. In that case (J’(1), J’(0), s) has causality within a more complex model, describing in more detail how people act on their beliefs.)
If the information is compact, we only consider states (1) to (4). Q.E.D.
Discussion: To understand the proof, look for example
at row 6: There is a true model for sequential states {1, 1} and {1, 0}, or to
maintain 1 or change to 0. But nothing is truly known about maintaining 0 or
changing back from 0 to 1 (though beliefs can exist). Observed is s = 0.
Perhaps it once was a conscious choice to go from 1 to 0, and perhaps one uses
the implied control {chance(0, 1)}
BI(1, 0) | truth. But we are concerned with durable
cases for which implied control would be costly. We want to see deliberate
rejection of the use of BI(0, 1). But this information is not present.
Hence the endurance of 0 is caused by chance.
Table 16: States of the system
|
|
J |
J(1) |
J(0) |
s |
meaning |
||
|
(1) |
|||||||