David W.K. Yeung1,2 and Leon A. Petrosyan3
1 Department of Business Administration, Hong Kong Shue Yan University, Hong Kong 2 Center of Game Theory, St Petersburg State University, St Petersburg, Russia E-mail: dwkyeung@hksyu. edu 3 Faculty of Applied Mathematics-Control Processes, Saint Petersburg State University, St Petersburg, Russia E-mail: [email protected]
Abstract The provision of public goods constitutes a classic case of market failure which calls for cooperative optimization. However, cooperation cannot be sustainable unless there is guarantee that the agreed-upon optimality principle can be maintained throughout the planning duration. This paper derives subgame consistent cooperative solutions for public goods provision by asymmetric agents with transferable payoffs in a stochastic discrete-time dynamic game framework. This is the first time that dynamic cooperative game in public goods provision is analysed.
Keywords: Public goods, stochastic dynamic games, dynamic cooperation, subgame consistency.
1. Introduction
Public goods, which are non-rival and non-excludable in consumption, are not uncommon in today’s economy. Examples of public goods include clean environment, national security, scientific knowledge, accessible public capital, technical know-how and public information. The non-exclusiveness and positive externalities of public goods constitutes major factors for market failure in their provision. In many contexts, the provision and use of public goods are carried out in an intertemporal discrete time-period framework under uncertainty. Cooperation suggests the possibility of socially optimal solutions in public goods provision problem. A discrete-time game framework is developed for theoretical analysis and practical applications. Problems concerning private provision of public goods are studied in Bergstrom (1986). Static analysis on provision of public goods are found in Chamberlin (1974), McGuire (1974) and Gradstein and Nitzan (1989). In many contexts, the provision and use of public goods are carried out in an intertemporal framework. Fershtman and Nitzan (1991) and Wirl (1996) considered differential games of public goods provision with symmetric agents. Wang and Ewald (2010) introduced stochastic elements into these games. Dockner et al. (2000) presented a game model with two asymmetric agents in which knowledge is a public good. These studies on dynamic game analysis focus on the noncooperative equilibria and the collusive solution that maximizes the joint payoffs of all agents.
In dynamic cooperation, the solution scheme would offer a long-term solution only if there is guarantee that participants will always be better off throughout the entire cooperation duration and the agreed-upon optimality principle be
* This research was supported by the HKSYU Research Grant.
maintained from the beginning to the end. To enable a cooperation scheme to be sustainable throughout the agreement period, a stringent condition is needed - that of subgame consistency. This condition requires that the optimality principle agreed upon at the outset must remain effective in any subgame starting at a later starting time with a realizable state brought about by prior optimal behaviour. Hence the players do not possess incentives to deviate from the cooperative scheme throughout the cooperative duration. The notion of subgame consistency in stochastic cooperative differential games was originated in Yeung and Petrosyan (2004) in which a generalized theorem for the derivation of an analytically tractable "payoff distribution procedure" (PDP) leading to subgame-consistent solutions has been developed. A discrete time version of the analysis is provided in Yeung and Petrosyan (2010). Yeung and Petrosyan (2013) presented subgame consistent cooperative solutions for public goods provision by asymmetric agents with transferable payoffs in a continuous-time stochastic differential game framework.
In this paper, an analytical framework entailing the essential features of public goods provision in a discrete-time stochastic dynamic paradigm is set up. The noncooperative game outcome is characterized and dynamic cooperation is considered. Group optimal strategies are derived and subgame consistent solutions are characterized. A ?payoff distribution procedure? leading to subgame-consistent solutions is derived. Illustrative examples are presented to demonstrate the derivation of subgame consistent solution for public goods provision game.
The paper is organized as follows. Section 2 provides the analytical framework and the non-cooperative outcome of public goods provision in a discrete-time stochastic dynamic framework. Details of a subgame consistent cooperative scheme are presented in Section 3. Illustrative examples are given in Section 4. Section 6 concludes the paper.
2. Analytical Framework and Non-cooperative Outcome
Consider the case of the provision of a public good in which a group of n agents carry out a project by making continuous contributions of some inputs or investments to build up a productive stock of a public good. The game horizon consists of T stages. We use Kt denote the level of the productive stock and It® denote the contribution to the public capital or investment by agent i at stage t € {1,2, • • • , T}. The stock accumulation dynamics is then
n
Kt+1 = E Ij - SKt + #t,K 1 = K0, for t € {1,2, ••• ,T}, (2.1)
j=i
where $t is a sequence of statistically independent random variables and S is the depreciation rate.
The payoff of agent i at stage t is
Ri(Kt) - Ci(li),i €{1, 2, ••• ,n} = N, (2.2)
where Rl(Kt) is the revenue/payoff to agent i, Cl(ID is the cost of investing It € X \
The objective of agent i € N is to maximize its expected net revenue over the planning horizon, that is
T
,»T{ ]T[Ri(Ks) - Ci(Isi)](1 + r)-(s-1) + qi(KT+i)(1 + r)-T } (2.3)
S= 1
subject to the stock accumulation dynamics (2.1),
where r is the discount rate, and qi(KT +1) > 0 is an amount conditional on the productive stock that agent i would received at stage T.
Acting for individual interests, the agents are involved in a stochastic dynamic game. In such a framework, a feedback Nash equilibrium has to be sought. Let {$S(K)€ IS, for i € N and s € {1, 2, • • • , T}} denote a set of feedback strategies that brings about a feedback Nash equilibrium of the game (2.1) and (2.3). Invoking the standard techniques for solving stochastic dynamic games, a feedback solution to the problem (2.1) and (2.3) can characterized by the following set of discretetime Hamilton-Jacobi-Bellman equations (see Basar and Olsder 1995; Yeung and Petrosyan 2012):
V*(t,K) = max{ [R(K) - Ci(It)](1+ r)-(t-1)
n
+V* [ t + 1, Y, $ (K)+ It - SK + 0t ] }, for t €{1, 2, ••• ,T}, (2.4)
j = 1
j = i
V*(T + 1,K) = q4(KT+1)(1 + r)-T, for i € N. (2.5)
A Nash equilibrium non-cooperative outcome of public goods provision by the n agents is characterized by the solution of the system of equations (2.4) - (2.5).
3. Subgame Consistent Cooperative Scheme
It is well-known problem that noncooperative provision of goods with externalities, in general, would lead to dynamic inefficiency. Cooperative games suggest the possibility of socially optimal and group efficient solutions to decision problems involving strategic action. Now consider the case when the agents agree to cooperate and extract gains from cooperation. In particular, they act cooperatively and agree to distribute the joint payoff among themselves according to an optimality principle. If any agent disagrees and deviates from the cooperation scheme, all agents will revert to the noncooperative framework to counteract the free-rider problem in public goods provision. In particular, free-riding would lead to a lower future payoff due to the loss of cooperative gains. Thus a credible threat is in place. In particular, group optimality, individual rationality and subgame consistency are three crucial properties that sustainable cooperative scheme has to satisfy.
3.1. Pareto Optimal Provision and Individual Rationality
To fulfill group optimality the agents would seek to maximize their expected joint payoff. To maximize their expected joint payoff the agents have to solve the stochastic dynamic programming problem
{nT
, max ,*t { E E[Rj(Ks) - Ci(Ij)](1 + r)-(s-1)
{Is for jeN} •=1 S = 1
n
+ Eqj(Kt+1)(1 + r)-T } j=1
(3.1)
subject to the stock dynamics (2.1).
Invoking the standard stochastic dynamic programming technique an optimal solution to the stochastic control problem (2.1) and (3.1) can characterized by the following set of equations (see Basar and Olsder (1995) and Yeung and petrosyan (2012)):
n
W (t,K) = .max E#t {]T [Rj (K) - C*(Ij )](1+ r)-(t-1)
№ for jEN,} ...
+W[t +1,^2 Ij - JK + ] }, for t e{1, 2, ••• ,T}, (3.2)
j=1
n
W (T +1, K) = E qj (Kt +i)(1 + r)-T. (3.3)
j=1
Let V£(K) ={^1*(K ),^2*(K), ••• ,^n*(K)}, for s e {1, 2, ••• ,T} denote a set of strategies that brings about an optimal cooperative solution. A group optimal solution of public goods provision by the n agents is characterized by the solution of the equation (3.2)-(3.3).
The optimal cooperative path can be derived as:
n
Kt+1 = E ^f(Kt) - JKt + 0t, K1 = K0, for t e {1, 2, ••• , T}, (3.4)
j=1
We use X* to denote the set of realizable values of Ks generated by (3.4) at stage s and use K* e X* to denote an element in the optimal set.
Let £(•, •) denote the agreed-upon imputation vector guiding the distribution of the total cooperative payoff under the agreed-upon optimality principle along the cooperative trajectory { K* }J=1. At stage s and if the productive stock is K*, the imputation vector according to £(•, •) is
£(s, K*) = [£1(s, K*), £2(s, K*), ••• ,£n(s,K;)], for s e {1, 2, ••• ,T}. (3.5)
A variety of examples of imputations £(s, K*) can be found in Yeung and Petrosyan (2006 and 2012). For individual rationality to be maintained throughout all stages, it is required that:
f (s, K*) > Vi(s,K*), for i e N and s e {1, 2, ••• ,T}.
To satisfy group optimality, the imputation vector has to satisfy
n
W(s, K*) = E £'(s, K*), for s e {1, 2, ••• , T}.
j=1
3.2. Subgame Consistent Solutions and Payoff Distribution Procedure
Under a subgame consistent situation, an extension of the solution policy to a subgame starting at a later stage with a state brought about by previous optimal behaviour would remain optimal. For subgame consistency to be satisfied, the imputation £(-, •) according to the original agreed-upon optimality principle in (3.5) has to be maintained along the cooperative trajectory { K* }J=1.
Following the analysis of Yeung and Petrosyan (2010 and 2012), we formulate a Payoff Distribution Procedure so that the agreed-upon imputations (3.5) can be realized.
Let B\(K*) denote the payment that agent i will received at stage k under the cooperative agreement if K* is realized at stage k e {1,2, • • • , T}.
The payment scheme involving Bk (K*) constitutes a PDP in the sense that if K* is realized at stage kthe imputation to agent i over the stages from k to T can be expressed as:
for i e N and k e k.
Theorem 3.1. Given that the public capital stock is K* in stage k a payment equalling
Z=k+1
for i e N and k e k.
Using (3.6) one can obtain
Z = k + 2
Upon substituting (3.7) into (3.6) yields
n
+E*fc ( f[k + 1, (K*) - JK* + ^k] ),
(3.8)
Bk(K*) = (1+ r)k-1{ f(K,xk)
n
EeU [k + 1,£Vj'*(K*) - JK* + tfkU },
(3.9)
for i e N, be paid to agent i at stage k e{1, 2, ••• , T} would lead to the realization of the imputation /£(k, K*), for k e {1, 2, • • • , T}}.
Proof. From (3.8), one can readily obtain (3.9). Theorem 4.1 can also be verified alternatively by showing that from (3.6)
f(k,K* ) = Bk (K)
1 + r
1 ( 1 \Z
+Eek+1,ek+2,..,eA E BjTO — +q\KT+1)(l+r)-T }
Z=k+1 V + /
n
= { f(k,K*) - ESk ( f[k +1,5>f (K*) - SK* + 0fc] ) }
j=1
T n
+ E Eefc+1,efc+2i... ,et{ f(C,K*) - Ee<( f[C + 1,E^' *(K*) - SK* + ^] ) }
C=fc+1 j=i
= f(k,K*);
given that f(T + 1, KT+1) =q4(KT+1)(1 + r)-T. □
Note that the payoff distribution procedure in Theorem 3.1 would give rise to the agreed-upon imputation in (3.5) and therefore subgame consistency is satisfied.
When all agents are using the cooperative strategies, the payoff that agent i will directly receive at stage s is
Ri(K*) - (K*)].
However, according to the agreed upon imputation, agent i is supposed to receive Bi (K*). Therefore a transfer payment (which could be positive or negative)
^(s,k; ) = Bi (k* ) - |Ri(K;) - (k* )]} (3.10)
will be allotted to agent i € N at stage s to yield the cooperative imputation
f(k,K*).
4. An Illustration
In this section, we provide an illustration with an application in the build-up of public capital by multiple asymmetric agents which is a discrete time counter-part of example in Yeung and Petrosyan (2013). Consider an economic region with n asymmetric agents. These agents receive benefits from an existing public capital stock K(s). The accumulation dynamics of the public capital stock is governed by
n
Kt+1 =E Ij - *Kt + 0t,K 1 = K0, for t €{1, 2, ••• ,T}, (4.1)
j=1
where S is the depreciation rate of the public capital, /t® is the investment made by the ith agent in the public capital in stage t, and $t is an independent random variable with non-negative range {$1,'$?, • • • , $“‘} and corresponding probabilities {A1, A?, ••• , A“‘}. Moreover £“= 1 A^h = $t > 0.
Each agent gains from the existing level of public capital and the ith agent seeks to maximize its expected stream of monetary gains:
T
E*,*,.. ,0T{ Eta'K - ci(li)2](1 + + (qiKt+1 + )(1 + r)-T }, (4.2)
S =1
subject to (4.1);
where a', c', and q2 are positive constants.
In particular, a' gives the gain that agent i derives from the public capital, c'(1i(s))2 is the cost of investing I] in the public capital, and (q1 KT+1 + qi) is the terminal valuation of the public capital at stage T +1. The noncooperative market outcome of the industry will be explored in the next subsection.
4.1. Noncooperative Market Outcome
Invoking the analysis in (2.1)-(2.5) in section 2 we obtain the corresponding Hamilton-Jacobi-Bellman equations
V'(t,K) = maxE*t { [a'K - ci(/ti)2](1 + r)-(t-1)
+V'[ t + 1, E # (K)+ It - ] }, for t e{1, 2, ••• ,T}, (4.3)
j = 1
j = i
V'(T + 1,K) = (qiKt +1 + qi)(1 + r)-T,for i e N. (4.4)
Performing the maximization operator in (4.3) yields:
Wt 1
4(K) = EA"^t+1 [t+l,Y^4(K)-SK + ^ } + * G N. (4.5)
h=1 j=1
To solve the game (4.1)-(4.2) we first obtain the value functions as follows. Proposition 4.1. The value function of agent i can be obtained as:
V'(t, K) = (AjK + C£)(1 + r)-(t-1), (4.6)
for t e {1, 2, • • • , T +1} and i e N; where AT +1= q1 and CT +1= q2,
A\= (a* - Aj+1(5) and C}= -*■ —h4j+1 ^ £"=i ^+Q+i;
for t e {1, 2, • • • , T}.
Proof. See Appendix A. □
Using Proposition 4.1 and (4.5) the game equilibrium strategies can be obtained to characterize the market equilibrium. The asymmetry of agents brings about different payoffs and investment levels in public capital investments.
4.2. Cooperative Provision of Public Capital
Now we consider the case when the agents agree to act cooperatively and seek higher gains. They agree to maximize their expected joint gain and distribute the cooperative gain proportional to their non-cooperative expected gains. To maximize their expected joint gains the agents maximize
T
V2' i
»i,*2,.. ,*T { EEak - c*(/j)2](i+r)-( s-1) 2=1s=1
n
+ E(ql KT +1 + q2 )(1+ r) T }> (4.7)
2=1
subject to dynamics (4.1).
Following the analysis in (3.2)-(3.3) in Section 3, the corresponding stochastic dynamic programming equation can be obtained as:
n
W(t,K) = max {V[a2K - c2(/')2](1+ r)-(t-1)
{Ij for jew} 2=
n
+W[t +1^/1 - JK + ] }, for t G{1, 2, ••• ,T}, (4.8)
1=1
n
W(T + 1, K) = E(<2Kt+1 + q*)(1 + r)-T. (4.9)
2=1
Performing the maximization operator in (4.8) yields:
4(K) = EA^^t+i [t+l,Y,4{K)-SK+^ } I e N. (4.10)
h=1 C 2=1
Proposition 4.2. The value function W(t, K) can be obtained as
W (t, K) = (AtK + Ct)(1 + r)-(t-1), (4.11)
for t G{1, 2, ••• ,T + 1}; 2
where at +1= En=1 q2 and CT +1= En=1 q2,
E"=i _ At+1(5an<i Ct =E"=i 4c/^—^t+i^t’+^t+i'
for t G {1, 2, ••• ,T}.
Proof. Follow the proof of Proposition 4.1. □
Using (4.10) and Proposition 4.2 the optimal investment strategy of public capital stock can be obtained as:
A
ipi(K) = —^,for i G N andt G {1,2, ••• ,T}. (4.12)
2 c®
Using (4.1) and (4.12) the optimal trajectory of public capital stock can be expressed as:
n A
Kt+i=[ J2^r~SKt ] +0t,K1=K°,foit<={l,2,---,T}, (4.13)
j=i
We use X* to denote the set of realizable values of Ks generated by (4.13) at stage s. The term K* G X* is used to denote and element in X*.
4.3. Subgame Consistent Payoff Distribution
Next, we will derive the payoff distribution procedure that leads to a subgame consistent solution. With the agents agreeing to distribute their gains proportional to their non-cooperative gains, the imputation vector becomes
V£(s K*)
f(s,iC) = ,w(s,kz)
EJ=i v j (s,K*) s'
Ai K* + Ci
=-------s s . (A,K*+C,)(l + r)-<-s-1), (4.14)
e;=m°k*s+c°)
for i G N and s G {1, 2, • • • , T} if the public capital stock is K* G X*.
To guarantee dynamical stability in a dynamic cooperation scheme, the solution has to satisfy the property of subgame consistency which requires the satisfaction of (4.14) at all stages s G {1, 2, • • • ,T}. Invoking Theorem 3.1 we can obtain: Proposition 4.3. A PDP which would lead to the realization of the imputation £(s,K*) in (4.14) includes a terminal payment (q1 K^+1 + ) to agent i G N at
stage T +1 and an payment at stage s G {1, 2, • • • , T}:
Ai K* + Ci
Bi(K*) =---------s s s . (AK* + CM
s s Y"=MlK*s + ciy s s s>
-£ *
h=1
h As + l^s+l(^s ) + fii+1 r| in +rrX for i G N
■ E?.,[4+1k;+1«) + cU »+ c-+J'1 + >■) •!"'e
(4.15) □
where K*+i(^h)= Yjj
A,
;j=l^-SK*s ]+^‘
Finally, when all agents are using the cooperative strategies, the payoff that
agent i will directly receive at stage s is
7 ir* (As+i)2 aJK„-------------—.
s 4cJ
However, according to the agreed upon imputation, agent i is to receive B£ (K*) in Proposition 4.3. Therefore a transfer payment (which can be positive or negative) equalling
^£(s,k; ) = b£ (K) -
k* -
OV1)2 4 a
(4.16)
will be imputed to agent i G N at stage s G {1, 2,
5. Concluding Remarks
This paper presented subgame consistent cooperative solutions for stochastic discretetime dynamic games in public goods provision. The solution scheme guarantees that the agreed-upon optimality principle can be maintained in any subgame and provides the basis for sustainable cooperation. A "payoff distribution procedure" (PDP) leading to subgame-consistent solutions is developed. Illustrative examples are presented to demonstrate the derivation of subgame consistent solution for public goods provision game. This is the first time that subgame consistent cooperative provision of public goods is analysed in discrete time. Various further research and applications, especially in the field of operations research, are expected.
Appendix A. Proof of Proposition 4.1.
Using the value functions in Proposition 4.1 the optimal strategies in (4.5) becomes:
Ai
4(K) = for i G N and t G {1,2, • • • , T}. (A.l)
2ci
Using (A.1) the Hamilton-Jacobi-Bellman equations (4.4)-(4.5) reduces to:
(Ai )2
AiK + ci = rfK - + ]Ta^
h=1
for i G N and t G {1,2, • • • , T},
AT+1K + Ci+1 = qiK + qi, for i G N. (A.3)
For (A.3) to hold it requires
AT +1 = qi and CT +1 = q2 . (A.4)
Re-arranging terms in (A.2) yields:
n A
[ + < ) + C7m ]-
j=1
(A.2)
(Ai )2 n AJ
A\K + Ct = (a‘ - A\+1S)K - + 4+i ( E + ^ ) + Q+i, (A.5)
for i G N and t G {1,2, • • • , T}. For (A.5) to hold it requires
j=1
(Ai )2 n Aj
A\ = (a‘ - A\+1S) and C\ = ~~^L + A\+1 ( E + #t )+ Q+i- (A.6)
j=1
Note that At and Ci depend on the model parameters and the succeeding values of Ai+1 andCti+1. Using (A.4) all At and Ci, for i G N and t G {1, 2, • • • , T}, are explicitly obtained.
Hence Proposition 4.1 follows. Q.E.D.
References
Basar, T., Olsder, G. J. (1995). Dynamic noncooperative game theory. Academic Press, London/New York.
Bergstrom, T., Blume, C., Varian, H. (1986). On the private provision of public goods. J Public Econ, 29, 25-49.
Chamberlin, J. (1974). Provision of collective goods as a function of group size. Am Polit Sci Rev, 65, 707-716.
Dockner, E., Jorgensen, S., Long, N. V., Sorger, G. (2000). Differential games in economics and management science. Cambridge University Press, Cambridge
Fershtman, C., Nitzan, S. (1991). Dynamic voluntary provision of public goods. Eur Econ Rev, 35, 1057-1067.
Gradstein, M., Nitzan, S. (1989). Binary participation and incremental provision of public goods. Soc Sci Choice Welf, 7, 171-192.
McGuire, M. (1974). Group size, group homogeneity, and the aggregate provision of a pure public good under Cournot behavior. Public Choice, 18, 107-126.
Wang, W-K, Ewald, C-O (2010). Dynamic voluntary provision of public goods with uncertainty: a stochastic differential game model. Decisions Econ Finan, 3, 97-116.
Wirl, F. (1996). Dynamic voluntary provision of public goods: extension to nonlinear strategies. Eur J Polit Econ, 12, 555-560.
Yeung, D.W. K., Petrosyan, L. A. (2004). Subgame consistent cooperative solution in stochastic differential games. J Optim Theory Appl, 120(2.3), 651-666.
Yeung, D.W. K., Petrosyan, L. A. (2006). Cooperative stochastic differential games. Springer-Verlag, New York
Yeung, D.W. K., Petrosyan, L. A. (2010) Subgame Consistent Solutions for cooperative Stochastic Dynamic Games. Journal of Optimization Theory and Applications 145(3): 579-596.
Yeung, D. W. K., Petrosyan, L. A. (2012). Subgame consistent economic optimization: an advanced cooperative dynamic game analysis. Boston, Birkhauser
Yeung, D.W. K., Petrosyan, L. A. (2013). Subgame Consistent Cooperative Provision of Public Goods. Dynamic Games and Applications, forthcoming in 2013. DOI: 10.1007/s13235-012-0062-7.