Subgame Consistent Solution for a Cooperative Differential Game of Climate Change Control
David W. K. Yeung1 and Leon A. Petrosyan2
1 SRS Consortium for Advanced Study in Cooperative Dynamic Games,
Hong Kong Shue Yan University, Braemar Hill, Hong Kong Center of Game Theory, Faculty of Applied Mathematics-Control Processes
2 St Petersburg University, Petrodivorets, St Petersburg, 198904, Russia,
Abstract After several decades of rapid technological advancement and economic growth, alarming levels of pollutions and environmental degradation are emerging all over the world. Moreover, it is now apparent that human activities are perturbing the climate system at the global scale leading to disturbances to complex ecological processes. In this paper, we present a cooperative differential game of climate change control. Climate change is incorporated as structural changes in the pollution dynamics and the payoff functions. The policy instruments of the game include taxes, abatement efforts and production technologies choices. Under cooperation, nations will make use of these instruments to maximize their joint payoff and distribute the payoff according to an agreed upon optimality principle. To ensure that the cooperative solution is dynamically consistent, this optimality principle has to be maintained throughout the period of cooperation. An analytically tractable payment distribution mechanism leading to the realization of the agreed upon imputation is formulated. This analysis widens the application of cooperative differential game theory to environmental problems with climate change. This is also the first time differential games with random changes in the structure of their state dynamics.
Keywords: Cooperative differential games, subgame consistency, climate change, environmental management.
1. Introduction
After several decades of rapid technological advancement and economic growth, alarming levels of pollutions and environmental degradation are emerging all over the world. Moreover, it is now apparent that human activities are perturbing the climate system at the global scale leading to disturbances to complex ecological processes. Climate change is typically structural change that affects the regeneration capacity of the natural environment. Even draconian measures (like a virtual phase-out of fossil fuel) would only slow or stop and not reverse climate change. Reports are portraying the situation as an industrial civilization on the verge of suicide, destroying its environmental conditions of existence with people being held as prisoners on a runaway catastrophe-bound train. Due to the geographical diffusion of pollutants and the global nature of climate change, unilateral response on the part of one country or region is often ineffective. Though cooperation in environmental control holds out the best promise of effective action, limited success has been observed. Existing multinational joint initiatives like the Kyoto Protocol can hardly be expected to offer a long-term solution because (i) the plans are limited only to emissions reduction which is unlikely be able to offer an effective mean to
halt the accelerating trend of environmental deterioration brought about by climate change, and (ii) there is no guarantee that participants will always be better off and hence be committed within the entire duration of the agreement.
Differential games provide an effective tool to study pollution control problems and to analyze the interactions between the participants' strategic behaviors and dynamic evolution of pollution. Applications of noncooperative differential games in environmental studies can be found in Yeung (1992), Dockner and Long (1993), Tahvonen (1994), Stimming (1999), Feenstra et al (2001) and Dockner and Leitmann (2001). Cooperative differential games in environmental control are presented by Dockner and Long (1993), J0rgensen and Zaccour (2001), Fredj et al (2004), Breton et al (2005 and 2006), Petrosyan and Zaccour (2003), Yeung (2007) and Yeung and Petrosyan (2008).
In dynamic cooperative games, a credible cooperative agreement has to be dynamically consistent. For dynamic consistency to hold in deterministic games, a stringent condition on the cooperative agreement is required: The specific optimal-ity principle must remain in effect at any instant of time throughout the game along the optimal state trajectory chosen at the outset. This condition is commonly known as time consistency. In the presence of stochastic elements, a more stringent condition - subgame consistency - is required for a dynamically consistent cooperative solution. A cooperative solution is subgame consistent if an extension of the solution policy to a situation with a later starting time and any feasible state brought about by prior optimal behaviors would remain optimal. Cooperative differential games that have identified dynamically consistent solutions can be found in J0rgensen and Zaccour (2001), Petrosyan and Zaccour (2003), Yeung and Petrosyan (2006a), Yeung (2007), and Yeung and Petrosyan (2004, 2005, 2006b and 2008)).
In this paper, we present a cooperative differential game of climate change control. Climate change is incorporated as structural changes in the pollution dynamics and the payoff functions. Since uncertainties in climate change have been observed (see Berliner (2003) and Allen et al. (2000)), a stochastic formulation of the changes is adopted. The policy instruments available include taxes, abatement efforts and production technologies choices. Under cooperation, nations will make use of these instruments to maximize their joint payoff and distribute the payoff according to an agreed upon optimality principle. To ensure that the cooperative solution is dynamically consistent, this optimality principle has to be maintained throughout the period of cooperation. Crucial to the analysis is the formulation of a payment distribution mechanism so that the agreed upon imputation will be realized. We follow Yeung and Petrosyan (2004 and 2006a) and derive an analytically tractable payment distribution mechanism ensuring the realization of dynamically consistent solutions. This analysis widens the application of cooperative differential game theory to environmental problems with climate change. This is also the first time differential games with randomly changes in the structure of their state dynamics.
The paper is organized as follows. Section 2 provides a game model with technologies choice and climate change. Noncooperative outcomes are characterized in Section 3. Cooperative arrangements, group optimal actions, solution state trajectories, and individually rational and dynamically consistent imputations are examined in Section 4. A payment distribution mechanism bringing about the proposed dynamically consistent solution is derived and scrutinized in Section 5. Section 6 ex-
amines the case where partial adoption of climate-preserving technologies appears. Concluding remarks are given in Section 7 and mathematical proofs are provided in the appendices.
2. A Game Model with Technology Choice and Climate Change
In this section we present a differential game model with technology choice and climate change. There are n asymmetric nations (or regions) and the game horizon is [to,T].
2.1. The Industrial Sector
These n asymmetric nations form an international or global economy. At time instant s the demand function of the output of nation i G N ={1, 2, ••• ,n} is
n
Pi(s) = ai "E j j(s), (2.1)
j=i
where Pi(s) is the price of the output of nation i, qj(s) is the output of nation j, ai and jj for i G N and j G N are positive constants. The output choice qj (s) G [0, q_j] is nonnegative and bounded by a maximum output constraint qj. Output price equals zero if the right-hand-side of (2.1) becomes negative. The demand system (2.1) shows that the world economy is a form of differentiated products oligopoly with substitute goods. In the case when a1 = aj and j = jj for all i G N and j G N, the industrial output is a homogeneous good. This type of model was first introduced by Dixit (1979) and later used in analyses in industrial organizations (see for example, Singh and Vives (1984)) and environmental games (see for examples, Yeung (2007) and Yeung and Petrosyan (2008)).
There are two types of technologies available to each nation's industrial sector: the existing technologies (which is not climate-preserving) and climate-preserving technologies. Industrial sectors pay more for using climate-preserving technologies. The amount of pollutants emitted by climate-preserving technologies is less than that by existing technologies. Moreover, non-climate-preserving technologies do not only emit more pollutants, they also damage the environment like destroying forests, making marine and animal species extinct directly or indirectly, breaking food-chains and desertification. These damages lead to structural climate changes. Therefore it is not pollution per se but the use of non-climate-preserving technologies that contributes to climate change. Climate conditions will be preserved only when climate-preserving technologies are used in the economy.
We use qj (s) to denote the output of nation j produced with existing technologies and ( (s) to denote the output of nation j produced with climate-preserving technologies. The cost of producing qj (s) units of output with existing technologies is ciqi(s) while that of producing ( (s) units of output with climate-preserving technologies is ciqi(s). In addition, ci >ci. In the absence of government regulation or incentive, the industrial sectors will not adopt climate-preserving technologies. In the case when all industrial sectors are using existing technologies industrial profits of nation i at time s can be expressed as:
n
n (s) = [ai "E j qj (s)]qi(s) - ^(s) - vt(s)q^(s), for i G N. (2.2) j=l
where vi(s) is the tax rate imposed by government i on industrial output produced by existing technologies at time s. At each time instant s, the industrial sector of nation i € N seeks to maximize (2.2). The first order condition for a Nash equilibrium for the n nations economy at time s yields
n
YjP, j(s) + PUi(s) = ai - q - Vi(s), for i € N. (2.3)
j=1
Equation system (2.3) is linear in q(s) = {q1(s), q2(s), • • • , qn(s)}. Taking the set of output tax rates v(s) = {v1(s),v2(s), • • • ,vn(s)} as parameters and solving (2.3) yield an industrial equilibrium which can be expressed as:
n
q,(s) = a4 + £% vj(s), (2.4)
j=l
where ai and , for i € N and j € N, are constants involving the model parameters % = • • ¡n; 02,0% ,• • • ;• • • ; • • ,%n}, a = {a1, a2, • • • ,an} and c = {c1,c2, • • • ,cn}. Proper choice of parameters leading to a valid industrial equilibrium is assumed.
In the case when all industrial sectors are using climate-preserving technologies industrial profits of nation i at time s can be expressed as:
n
^¿(s) = [a4 - ^ ¡i qj (sM(s) - ciqi(s) - vi(s)qi (s), for i € N. j=l
Once again a system of linear equations in q(s) = {q1(s), q2(s), • • • , qn(s)} is formed. The set of output tax rates v(s) = { v1 (s), v2 (s), • • • , vn(s)} can be regarded as a set of parameters. An industrial equilibrium gives:
n
qi(s) = ai + qj vj(s), for i € n. j=1
2.2. Pollution Dynamics
Industrial production emits pollutants into the environment and the amount of pollution created by different nations' outputs may be different. Each government adopts its own pollution abatement policy to reduce pollutants existing in the environment. At time t0 the climate condition in the time interval [t0,t1), for t1 <T, is known to be . Let x(s) C R+ denote the level of pollution at time s, the dynamics of pollution stock under climate condition is governed by the differential equation:
X(s) = aj° j(s) - bj0uj(s)[x(s)]1/2 - Sdo x(s), x(to) = Xt0, s £ [to,ti), j=i j=i 0
(2.5)
9
where a90 is the amount of pollution created by a unit of nation j's output Uj(s) is the level of pollution abatement activities of nation j,
bj "J
activities of nation j ,
b„0 Uj (s)X(s)]1/2 is the amount of pollution removed by Uj (s) level of abatement
5,.0 is the natural rate of decay of the pollutants, and the initial level of pollution
U0
at time t0 is given as xt0.
Since existing technologies are not climate preserving, the climate condition will deteriorate. Moreover, uncertainties in climate change have been observed (see Berliner (2003) and Allen et al (2000)). In particular, in future instant of time tk, for k €{1, 2, ■ ■ ■ , t} and t1 <t2 < ■ ■ ■ <tp < T =tp+1, the change in climate is
k[ ] k\ ]
affected by a series of random climate variables eak ]. If eak ] has occurred at time tk, it will prevail in the period [tk,tk+1). The process 0!klk], for k €{1, 2,- ■ ■ ,t}, is a random variable stemming from a randomly branching process as described below.
Given has occurred in the interval [t0,t1), the random variable 0lf'a° € {Q10,ao^ , ]} will occur with corresponding probabilities
{^i\o,ao],^i[o,a°], ..., Alj-,°'a0 1} in the period [t1,t2). Note that probabilities of the occurrence of the climate variables are affected by the level of pollution. Given that e1a[°'ao] € {e!(0'ao\e11\(0'a°, ...,e1[0'ao]} has been realized in [ti,t2) , the random variable e2c2i'ai') € {e2^1,ai) ,Q2-tyi,ai'), ..., e^^a^} will occur with corresponding probabilities {A?[(1'ai)],A^[(1,ai)], ..., A^'"1^ } in the period [t2,ts).
In general, given that e10'ao\e2a[(1'ai)\■ ■ ■ ^eka--[}i'ai)(2'a2)'''(k-2'ak-2)] has been realized, the random variable ekJk(1'ai) (2'a2)'''(k-1'ak-i) € {ek[(1'ai)(2'a2)."(k-1'ak-i)\ ek[(1'ai) (a'a2)...(k-1'ak-i)], ......, ...ttC^)]} will occur with correspond-
ing probabilities {Ap'ai)(2'a2)...(k-1'ak-i) ,Ak[(1'ai)(2'a2)...(k-1'ak-i)], ...
in the period [tk ,tk+1), for k € {1, 2, ■ ■ ■ ,p}. Finally, given that eal°'ao] 02a^'ai)] ■ ■ ■ ,eltt1'ai)(2'a2)-(p-1'ar-i) has been realized, the random variable eTJ}l'ai)(2'a2Y-(P'a^€ {eT[(1'ai)(2'a2)...(P'aP)],eiT^^mp^^ ■ ■ ■ ■ ■ ,0n (2'a2)..(p'ap))} will occur with corresponding probabilities
A[(1'ai)(2'a2)...(P'ap)] AT[(1'ai)(2'a2)...(P'ap)] _ ATK^i )(2'a2)...(p,ap)] , at t t i { 1 , 2 , , VT [(1'ai)(2'a2)...(P'ap)]} .
reversible climate change implies that there is no possibility for climate condition to revert to a better state.
If the climate condition ea<ll(1'ai)(2'a2)...(k 1,ak-i)] occurs in the time interval [tk,tk+1), the dynamics of pollution stock becomes:
n „k[(i,ai)(2,a2)...(k-i,ak-i)] n k[(i,ai) (2a2)-(k-iak-i)]
x(s) = £ ae;k j (s)-£ bp Uj (s)[x(s)]1/2
j=1 j=1 - 5 k[(i,ai)(2,a2)...(k-i,ak-i)] x(s), for s € [tk,tk+1) and k €{1, 2, ■ ■ ■ ,p}.
(2.6)
Dynamics (2.6) reflects that climate change is a structural change such that the transformation of industrial emission into pollutants, the natural rate of decay and
gk[-] gkV]
the effects of abatement activities could be affected. More specifically, a j > aje ,
ek[•] ekV] ek[-] ekV] kM
bj < bj and 5^ < 5je if eq represents a worse climate condition than that represented by .
2.3. The Governments' Objectives
The governments have to promote business interests and at the same time bear the costs brought about by pollution and climate conditions. In particular, each government maximizes the gains in the industrial sector plus tax revenue minus expenditures on pollution abatement, damages from pollution and losses from climate conditions. In the time interval [t°,ti) the instantaneous objective of government i can be expressed as:
ßjqj(s)]q(s) - ci[q(s)]2 - C1K(s)]2 - x(s) - e?, « g n, (2.7)
where c* [ui(s)]2 is the cost of carrying out ui level of pollution abatement activities,
8° 8° hi°x(s) is the value of damage to nation i from x(s) amount of pollution, and e
is cost to nation i under climate condition 0°. Note that the damage from pollution
8°
could be related to the climate condition. Moreover, the cost e8° reflects losses from floods, draughts, abnormal temperatures, storms, heat waves, cold spell and similar climate change related problems.
If the climate condition ok[(1'ai)(2'a2)"'(k 1,afc-i)] occurs in the time interval [tk, tk+i), for k G {1, 2, ■ ■ ■ , p}, the instantaneous objective of government i becomes:
k[(1,ai) (2,a2)...(k-1,ak-1)] 8 a
K ^ßjqj(s)]qi(s) - ci[qi(s)] - ctUiS]2 - h:k x(s)
j=1
k[(1,ai) (2,a2)...(k-1,ak-i)]
- ea , « G N. (2.8)
8k[8kl] 8k['] 8kl] fcf.l In particular, hi ç > hi 1 and ei ç > e if 0Ç represents a worse climate
&H
condition than that represented by 0/ .
qT [(1,^1 )(2,a2)...(p,ap)]
damage is T^^(.^-(P.ap)] Kt[(1,ao (2,a2)...(P,ap)] - x(T)] where
At time T, if 0aT 'ai 'a2>---(p'ap>' occurs, the terminal appraisal of pollution
git\(1,a1) (2,a2)...(P,a )] > 0. In particular, if the terminal level of pollution is lower
8 a^
(higher) than x1 T[(1tai) (2,a2)...(p,a )], government i will receive a bonus (penalty)
8 aT
equaling g8 t l(i,ai)(2,a2)...P,ap )] [X8t [(1 ,a i ) (2 ,a2 ) ... (p,ap )] - X(T )]. M0re0Ver, 98 T [•]> 98 T [•] 8aT 8 aT 81
and xi t • < xi t • if J represents a worse climate condition than that represented
by 0T[ ].
The discount rate is r. Each one of the n governments seeks to maximize the integral of its instantaneous objective specified in (2.7)-(2.8) over the planning horizon subject to pollution dynamics (2.5)-(2.6). By substituting
n _
qi(s) = ai + 22 $ vi(s), for i G N, from (2.4) into (2.5)-(2.8), one obtains a j=i J j
stochastic differential game in which government i G N seeks to:
T
T
max
Vi(s) ,Ui(s)
[ai - £ Kj(aj + £ j vh(s))](oi + ^ Kh vh(s))
1 vh(
j=i h=i
h=1
- Ci[«i + ^ Kh Vh(s)] - C*[Ui(s)]2 - h8°x(s) - e8
p ni
h=i V2(1,ai)
3-r(s-t°)ds
Vk[(1,ai)(2,a2)...(k-1,ak-i)] + y^ ^ ^i[0,a°] ^ ^2[i,ai] ■ ■ ■ ^ xk[(l,ai)(2,a2)...(k-i,ak-i)]
k=i ai=i a2 = i afc = i
tk+i
n n n n
/ [ai - £/(aj + £j Vh(s))](ai + £Kh Vh(s)) - Ci[ai + £Kh Vh(s)]-
^ L _ 1 I, _1 Z, _1 Z, _1
tk
ca[Ui(s)]2 - h8ak
k [(1 ,a i ) (2,a[)...(k-1,ak-i)]
x( s) - e
h=i
k [(1 ,a i ) (2,a[)...(k-1,ak-i)]
h=i
e-r(s-to) ds
ni
n2(1,ai)
nT [(1,ai)(2,a2)...(p,ap )]
+ £ Aai°'a°] £ ^
ai = i a2 = i
XT^a )(2,a[)..
■(P,aP)]
a =i
gl„ T[(1,ai)(2,a2)...(p,ap )] [xK T[(1 ,a i) (2 ,a [) ... (p,ap)] - x(T )]e ^(T ^
for i G N. (2.9)
subject to
x(s) = £ j[aj + £ /?h vh(s)] - £ j uj(s)[x(s)]i/2 - s8°°x(s),
j=i
h=i
j=i
c(t°) = Xt°, s G [t°,ti), and
x(s) = £ (j*k j=i
k [(1 ,a i ) (2,a2)...(k-1,ak-i)] 8 a
[aj + £Kh Vh(s)]
h=i
n 8kl(1,al)(2,a2)...(k-1,ak-l)]
-£ bp Uj(s)[x(s)]i/2
j=i
- ^ k[(i,a1) (2,a2)...(k—i,ak_1)] x(s), for s G [tk,tk+i) and k g{1, 2, •
,p}. (2.10)
3. Noncooperative Outcomes
In this section we discuss the solution to the noncooperative game (2.9)-(2.10).
To obtain a feedback solution for the game, we first consider the solution for the sub-
game in the last time interval, that is [tp ,T]. For the case where
0pp(i,ai) (2,a,2)...(p i,ap_i)] has occurred at time instant tp and x(tp) = xtp G X,
player i would seek to:
max
Vi(s),Ui(s)
K ^ ßj + E ßh Vh(s))](ai + E ßi vh(s))
j=l
h=l
h=l
- aa + ë ßh vh(s)] - caiui(s)]2
h=l
p[(1,ai) (2,a2)...(p-l,ap-1)] - k,ap
p[(1,ai) (2,a2)...(p-1,a -i)]
(s) -
e-r(s-tp)ds
VT [(1,ai)(2,a2)...(p,ap )]
+ E xT[^i'ai)(2'a2)-(p'ap)]ni
aT = 1
p)ln i ¡-T-i
9„T [(1 ,a i ) (2,a2)...(p,ap )] [x„T [(1 ,ai) (2 ,a 2) ...(p,ap )] "a^ "ar-n
- x(T)]e-r(T-tp) for i e N,
(3.1)
subject to
x(s) = E
n p[(1,ai)(2,a2)...(p-1,ap-i)]
[j + E j vh(s)]
j=i
j=i
p[(1,ai) (2,a2)...(p-1,ap-i)]
h=l
Uj (s)X(s)] ' - S„[(1,ai)(2,a2)...(p-1,ap-i)] x(s),
x(tp) = xt e X for s e [tp,T].
(3.2)
A feedback Nash equilibrium solution can be characterized with the techniques developed by Isaacs (1965), Bellman (1957) and Nash (1951) as:
Lemma 3.1. A set of feedback strategies [u*(t) = jJiP(gpi(1'ai)('2'a2)-(p-1'ap-1)]. t,x), v*(t) = 4>p(epalp{1'ai)(2,a2)-(p-1,aP-l)]. t,x), for i £ N and t £ [tp, T] } provides a Nash equilibrium solution to the game (3.11)-(3.12) if there exist continuously differen-tiable functions V(tp)i(Qpi(1'ai)(2'a2)-(p-1,ap-l)]. t,x):[tp,T]xR^ R, i £ N, satisfying the following partial differential equations:
( r n n
-Vt{tp)i(e^; t, x) = mUX { ^ ^ ßi(aj + j t,x) + ßjVi)]
v%,u j=1 h = 1
h = 1
h=i
X[«i + E ßh ft (°P t,x) + ßi Vi ] - Ci [j + E ßh K(K t,x) + ßivi]
h = 1
h=i
h = 1
h=i
-eau? - h"
ga\-] ga\-]
' x-Ei ap
-r(t-tp)
p
p
p
a
j
p
+V(tp)i(0Pp];t,x)' 8 "
a + E M € (0pp p]; *,x) +M ^ L j=i h = i h = i
' 8P['] 8P['] 1 1
^ E br t,x)x1/2 - bp UiX1/2 - ô8Pj,x |
j = 1
j =i
VT [(1,ai)(2,a2)...(p,ap )]
V (tp)i (0pp]; T, x) = ¿2 AaT(1'ai)(2'a2)'"( p 'ap)]g8 T [(i,ai)(2,
p)
ap
a2)'"(p,ap)]
*t = i
X[XIt [(1,ai)(2,a2)...(p,ap )] - x(T )]e r(T tp); 8 aT
for i G N, (3.3)
where 0pp[p] is the short form for op^^H2'^. p-ia—i)].
Performing the indicated maximization in (3.3) yields:
8P[•] b ap
n n \
-12 jaj + E M pp ]; t,x)i m
j=1 h EN '
nn
- E jj M l[ai + E M K (0p p ] ; t, x)l -
j=1 h El
+VitP)i(0p p ]; i,x)er(t-tp)Y^ j M = 0, (3.4)
8pH j
j=i
for t G [t p < T] and i G N.
System (3.4) forms a set of equations linear in
{$P (0pp ]; t, x), $ (0pp ]; t,x),..., <(0 P p ]; t, x)} with
{V(tp)i(0pp]; t, x)er(t-tp\ vXtp2(0Pp]; t, x)er(t-tp), ■ ■ ■ ,Vx(tp)n(0PJp]; t, x)er(t-tp)}
being taken as a set of parameters. Solving the second set of equations in (3.4) yields:
# (0p p ]; t,x) = a^, K^i V(tp)i (0pP ]; t,x)er(t-tp), i e N, (3.5) j=1
where ai p[.] and p[,] , for i e N and j e N, are constants involving the constant
8 ap 8 ap j
coefficients in (3.4). Substituting the results in (3.4) and (3.5) into (3.3) and upon solving we obtain:
a
Consistent Solution for a Cooperative Differentia,l Game Proposition 3.1. System (3.3) admits a solution
y(tp)i(epp];t,x) = [Af(epp];t)x + Cf(e P p];t)] e-r(t-tp), for i € N, (3.6)
where {A f(efp]; t),A2(efp];t), ■ ■ ■ ,Apn(efp];t)} and
{Cf (ef p ]; t),Cf (efp]; t), ■ ■ ■ ,Cp (efp]; t)} satisfy the following sets of constant differential equations:
e p [•] (b p )2
M(opp];t) = (r +6^,) Attest) -
gp H
(6Y ,, gpv\
ii (e a p ;
E H^M^ff^ + V' ; (3-7)
2ca
j = 1 j
j = i
ep (opp ]; t) = rep (epp J; t)- aj [a +£ j ahpH+£ ^ K(<]; t)]}
j=1 h=1 k=1
ai + E jh Kphi + E jhPH]k Ak(epp];t)]
gp k
h =1 k=1
+cda - £ j j + £ jjPHk Ak (epp ]; t)]} + £g
à3
gpp gpp k j=1 p k=1 p
-Ap (epp]; t)
n gph n n
!>/ p [a + E j +E ~àhepHk Ak(epp ]; t)]}
j=1 h =1 k=1 QpI• [(i,pi)(2,p2)-(p,pp)] \Tl(1,ai)(2,a2)...(p,ap)] i
; (3.8)
AP(QP[ • [(i,ai)(2,a2)...(p,ap ) ] ,1 [(1 ,ai )(2 ,a2 ) ... (f,a p )] i ,
Ai (e a p ; T )= 1t = 1 AaT 9eT [(i,ai)(2,a2)...( p ,a p )] and
eaT
VT [(i,ai)(2,a2)...( p ,ap ) ]
Cf (ef [' ]; T) = ^ \Tl{1'ai) (2'a2)...( P ,a p)]
aT=1
X^ T [(i,ai)(2,a2)...( p ,a p )] x]'eT [(i,ai) (2,a2)...( p ,ap ) ] ; eaT eaT
for i € N. (3.9)
Proof. See Appendix 1. □
Using (3.4), (3.5) and the results in Proposition 3.1, the corresponding feedback Nash equilibrium strategies of the game (3.1)-(3.2) can be obtained as:
ea[•]
ba
= and
#(0Pp[p];t,x) = ai [.]+Yi,^].Ap(OpP];t), for i G N and t G [tT,T].
8a 8a j
j=i
A remark that will be utilized in subsequent analysis is given below.
Remark 3.1. Let V p(T )i(opp] ; t, x) denote the value function of nation i in a game with payoffs (3.1) and dynamics (3.2) which starts at time t for t G [tp,T]. One can readily verify that V(tp)i(0p[p]; t, x)=er(T-tp)Vp(t)i(0p[p]; t, x), for t G [tp, T].
Lemma 3.1 characterizes the players' value function V(tp)i(0p<[p]; t,x) during the time interval [t p,T] in the case where 0p[p] G{0p[ '],0p[ '],■ ■ ■ ,0plp]} has occurred. In order to formulate the subgame in the second last time interval [t p-i,tp), it is necessary to identify the terminal payoffs at time t . To do this, first note that if op[(1'ai)(2'a2)...(p 1,ap_i)] occurs at time tp the value function of player i is V(tP)i(0 p [(i,ai) (2,a2)...(p-i,ap_i)]; tp,x) at tp. The expected terminal payoff for player i at time t can evaluated as:
Vp [(i,ai)(2,a2)...( p-i,o p-i)]
EA p [(1,ai )(2'a2)...( P-1'a p-i )] V (t p )i(0 P [(1'ai)(2'a2)...( P -1'a p-i )] . t x) ap V ap i pi >
for i e N, (3.10)
For the case where 0pp-1i[(1'ai) (2'a2)"'(p 2'ap-2)] occurs in time interval [t p-1,tp) and x(tp-1 ) = xp-1 at time tp-1, the subgame in question becomes an n—person game with duration [t p-1,tp), in which player i maximizes the expected payoff:
/tp r n n n
[ai -Y. j (aj + H Mh Vh (s))l(ai ^ Mh Vh(s))
-p-^ j=1 h=1 h=1
p-i[(i,ai) (2,0.2)...(p 2,a-2)]
-oa + e Mh «h(s)i - c>i(s)]2 - p-i x(s)
h=1
e-r(s-tp-i) ds
p-i[(ioi) (2,02)...( p-2,o p-2)] 80
0 p -i
Vp [(i,0i)(2,02)...( p-i,o p-i)]
+ V ap [(1'ai)(2'a2)...( p-1a p-i)]
a
xV(t p )i(0p [ (1'ai) (2'a2)...( p-1'a p i)] ; t p, x)e-r(t p-tp-i), for i e N, (3.11)
subject to
a„-
x(s) = E -3 j=i
n „ p-1[(1,a1)(2,a2)...( p-2,a p-2)] n
[a3 + Eß vh(s)]
h=1
n „ p-1[(1,ai)(2,a2)...( p-2,a p-2)]
' - Uj (s)[x(s)]V2
-E 3 p-1
3=1
-<^ p-1[(1,a1)(2,a2)...(p-2,a 2)] x(s), x(tT-1 )= Xtp-1 G X, für S G [tp-1,tp)
(3.12)
A feedback Nash equilibrium solution can be characterized as:
Lemma 3.2. A sett of feedback strategies {u*(t) =pp-i(0p-i1[(i'ai) (2'a2)."( p-2,a p _2)];
t,x), V*(t) = $-l(0pppai )(2,a2)...(p-2,ap^2)]; t,x), for i G N and t G [tT ,T]} provides a Nash equilibrium solution to the game (3.11)-(3.12) if there exist continuously differentiable functions V(t r—l)i(0P-—1¡(1'al )(2,a2)...( p-2,a p _2)]; t,x):[t p-i ,t p ]xR^ R, i G N, satisfying the following partial differential equations:
C r n n
-V(tp-O^p-IH. t, x) = rn^{ [ ß ^ßja + E ßh €-1(VPp-n1]; t,x)+ßiVi)]
j=1 h = 1
h = i
x[aii + E ßh ^X^^ t, x) + ßlvi] h = 1
h=i
-Ci[«i + E ßh K-1(eP--11[];t,x)+ßivi]-cak]2-hp-1 xsp-h = 1
h=i
-r(t-t p-1)
^S-N; t,x) ¿ ap- [a3 + ^ ßh ^X-1^; t, x) + ß Vi]
L 3=1
h = 1
h=i
8p„-
8p„-
-12 3-1 ^X-1'0; t,x)x1/2 - -1 Uix1/2 -
j = 1 a p -j =i
1
p
p-1
e
x
np [(i,ai)(2,a2)...( p-i,a p-i)]
y(tp-i)i(ep-1[-]-1 x) = V^ ap[(1'ai)(2'a2)...(p-1'ap-i)]
a -i a a
xy(tp)i(ep[(1,ai)(2,a2)...(p-1'ap i)]; tp, x)e-r(tp-tp-i); for i € N, (3.13)
where eP-] is the short form for ef-^1'^ (2'a2)...(p-2'ap-2)\
Following the analysis above, the value functions y(tp-i)i(ep¡-1i['];t,x) can be characterized as:
Proposition 3.2.
y(tp-i)i(epp--1lil;t,x) = [Ap-1(ep--^;t)x + cr^ep-1;t)] e-
for i € N, (3.14)
where A-1^-^; t), Af-1(eP--[1]; t), ■ ■ ■ , A- (epa--[1]; t)} and
{CP-1(epp--ll1]; t), Cf-1(ep--1[1]; t), ■ ■ ■ , Cp-1(ep--1[1]; t)} satisfy
Vp [(i,ai)(2,a2)...( p-i,a-)]
AP-1(eP_1I']; T)= AP[i1'ai)(2'a2)...(p-1,ap-i)]
a
XAP. (ep[(1'ai )(2'a2)...(p-1'ap-i)]; tp,x),
Vp [( i ,a i )(2,a2)...( p - i ,a - i )]
Cf-1(eP--•; T)= £ AP^^ i )(2'a2)...(P-1,ap-1 )]
a
xCp (ep[(1'a i )(2'a2)...( p-1'a p-)]; t p, x); (3.15)
and equations (3.7) and (3.8) with
Ap (ep[pl; t) replaced by Af-1 (ef-^; t), Cf (e^; t) by Cf-1 (ef-^; t), and ef[p]
by ef
Proof. Follow the Proof of Appendix 1. □
Following the above analysis, for the subgame in the interval [tk ,tk+1) with eak-kl'ai)(2,a2)...(k 1,ak-i)} occurs in interval, the expected terminal payoff for player i at time tk+1 can evaluated as:
Vk+ i[( i ,a i )(2,a2)...(k,ak )]
EAk+1[(1,a i )(2'a2)...(k,ak)] Aak+1
ak+1
xy(tk+i)i(ek+i[(1'ai)(2'a2)...(k'ak)];tk+1,x), for i € N. (3.16)
For the case where e^k-1'0,i)(2,a2)..(k 1,ak-j.)] occurs in time interval [tk,tk+1) and x(tk) = xk at time tk for k € {0,1, 2, ■ ■ ■ ,t — 1}, the subgame in question
becomes an n-person game with duration [tk,tk+1), in which player i maximizes the expected payoff:
tk + i
n n n n
/ [ai - £ j (aj + Vh(s))](ai + «h (s)) - c^M + «h(s)]
J L A- 1 1-._ -I 1-. _-I 1-. _-i
tk
8k[(i,ai) (2,02)...(k-i,ak-i)\ 8k[(ioi) (2,02) ... (k-i ,ak-i)
-C>i(s)]2 - hp x(s) - £i°k
'-r(s-tk ) ds
nk + i[(i,ai)(2,a2)...(k,ak )]
+ 12 Afc+1[(1'ai)(2'a2)...(k'ak)]
ak+i
xv (k+1)i (0k+1i[(1'ai )(2'a2)...(k'ak)]; tfc+1 ,x(tk+1))e-r(tk+i-tk), for i e N, (3.17)
subject to
n 8k[(i,ai)(2,a2)...(k-i,ak-i)i n
x(s) = Yl j [aj + 12 Mh Vh(s)l
j=1 h=1
n 8k[(i,ai)(2,a2)...(k-i,ak-i)]
-£ bjak Uj(s)[x(s)]1/2
j=1
-5 k[(i,ai)(2,a2)...(k-i,ak-i)\ x(s), x(tk )= xtk e X f°r s e [tk ,tk+1)- (3.18) 8ak
Following the above analysis, one can obtain Proposition 3.3.
V (tk )i (0k[(l'ai)(2'a2)...(k-1'ak-i)]- t x) =
[Ak(0k[(l'ai ) (2'a2)...(k-1'ak-i)] • t)x + Ck(0k[(1'ai) (2'a2)...(k- 1a-i )] • t)] e-r(t-tk) r ak , r ak , (3.19)
for i e N, t e [tk, tk+1] ,
/jk[(1'ai) (2'a2)...(k-1'ak-i)] r /jk[(1'ai) (2 a )... (k-1 ^k-1 )] ,,k[(1'ai ) (2^2)...(k-^ak-i )] 0ak ei01 ,02
• )]} and k g{o, 1,2, ■ ■ ■ ,p -1};
where Ak(0kk(1'ai)(2'a2)...(k-i'ak_i)]; t) and Ck(0k[(1'ai) (-2'a^-(-k-1,ak_i)];t), for i G N, satisfy the following sets of differential equations:
(i°pp )2
Àî(°l[-];î) = (r + s^AKe^-t) - Ll_L[^(0M-];î))]2
gk[H]
" (b pk )2 gk[H]
]T + (3-20)
j
j=1
j=i
ek (ekp ]; t) = rek (ekp ]; t)
/ n n n
(a - £ j [aj + £ j [àh9kaH] + £ àhkN, Ak (ekk ] ; t)]}
^ j=1 h=1 pk 1=1 gp nn
a + £ jh [àhpH + £ ~PhepkHh Akk(ekp]; t)] h = 1 pp 1=1 p
n n gkH
+Ci[j -Y j ajgpan + £ jjpPH, Ak (ekp ]; t)]} + £ipp j=1 pp £=1 pp
-Ak (ekp ]; t)
n gpH '
][>> [a + £ jh K^+Y àh^ Ak (ekp ]; t)]} L j=1 h =1 pp £=1 pp
np + 1[(1,pi)(2,p2)...(p,pp )]
Ak (Bkp ] • T )= £ ^fc+1[(1,ai )(2,a2)...(k,ap)]
Sk'eap ; T ) =
ap+i
xAk+1(ek+1(1'ai)(2'a2)---(k'ap)]; tk+1,x), np+i[(i,pi)(2,p2)...(p,pp )]
ek (ek [' ]; T )= £ ^fc+1[(1,ai)(2,a2)...(fc,ap )]
k
ea p T ) =
a p+i
xek+1(ek++1il(1'ai)(2'a2)...(fc'a p )]; tk+1, x); (3.21)
where ek ] is the short form for ekp(1'ai)(2'a2)...(k-1'ap-i) .
Proof. Follow the proofs of Propositions 3.1 and 3.2. □
The corresponding feedback Nash equilibrium strategies of the game (3.17)-(3.18) can be obtained as:
ekn
ba
= and
rf (ekp ]; t,x) = agpu+YP KphU A ^p ]; t), for i £ N and t £ [tk ,tk+1);
gpp gpp j
j=1
where 8^ is the short form for 8kal^1'ai)(2'a2)-(k-1'ak-l)].
Though continual adoption of non-climate-preserving technologies would lead to further irreversible climate deterioration, nations have no incentive to switch to climate-preserving technologies while other nations are using non-climate-preserving technologies. A global ban on non-climate-preserving technologies would unlikely receive unanimous approval because some nations may face higher production cost differentials in switching to climate-preserving technologies than others'. Only through cooperation and proper appropriation of gains could the problem be tackled.
4. Cooperative Arrangements in Climate Change Control
Now consider the case when all the nations want to cooperate and agree to act so that an international optimum could be achieved. Cooperation will cease if any of the nations refuses to act accordingly at any time within the game horizon. An agreement on the choice of technologies, taxes imposed, abatement efforts and an optimality principle to allocate the cooperative payoff will be sought. For the cooperative scheme to be upheld throughout the game horizon both group rationality and individual rationality are required to be satisfied at any time. Group optimality ensures that all potential gains from cooperation are captured. Failure to fulfill group optimality leads to condition where the participants prefer to deviate from the agreed upon solution plan in order to extract the unexploited gains. Individual rationality is required to hold so that the payoff allocated to a nation under cooperation will be no less than its noncooperative payoff. Failure to guarantee individual rationality leads to the condition where the concerned participants would reject the agreed upon solution plan and play noncooperatively.
4.1. Group Optimality and Cooperative State Trajectory
Given that there are two technologies available, the nations have a technology choice. Consider first the case when all nations agree to adopt climate-preserving technologies from time t0 to time T. If such technologies were used, the climate will be preserved as 8° throughout the game duration. To secure group optimality the participating nations seek to maximize their joint expected payoff by solving the following control problem:
Vl,V2,
max
,vn; Ul ,U2,
E
K -E +E j vh(s))](a
to
j=1
h=1
K +E ^ Vh(s)) h=l
-cK[â" + E ßh Vh(s)] - caK k(s)]2 - hK x(s) -
h=1
3-r(s-to) ds
o
0
o
£
rt
n 1
+ E 9k [XK0o - x(T)]e-r(T(4.1)
k=1 '
subject to
n n n
X(s) = E a0° +E ßh vh(s)]-E b0°Uj(s)[x(s)]1/2-Ö00 x(s),x(to) = xto. (4.2)
j=i h=i j=i
Invoking Bellman's (1957) technique of dynamic programming a set of controls
{$0 go
[v**(t),u*i*(t)] = (t,x),^i0 (t,x)], for i e N} constitutes an optimal solution to the control problem (4.1) and (4.2) if there exists continuously differentiable function W(to)(8°; t,x):[t0,T]xR^ R, i e N, satisfying the following partial differential equations:
-Wt(to)(8°; t,x)=
max
E
K = 1
[aK -12 j$ + E $ vh)](aK + E $ vh) j=1 h=1 h=1
h=1
+ E k Vh] - c>J]2 - hKx -
h=1
-r(t-to)
+Wito)(80; t,x) | E a*0 [cj + E $ Vh] - E bju}(x)1/2 - St
Kj=1
h=1
h vh ] ^ »j j
j=1
0x
n
W(to) (8°; T, x) = £ gKo [xe% - x(T)]e-r(T-to).
K = 1
(4.3)
Performing the indicated maximization in (4.3) yields the optimal controls under cooperation as:
b$0
vj i°{t,x) = —l—Wp\d°0;t,x)er<-t-t^x1/2, for ¡6JV; (4.4)
2d
n i- / n n \
E a K ^ j aj + E $ tf (t,x)] A
K = 1 L ^ j=1 h =1 '
nn
-E j$][aK + E $ tf (t, x)] - $ kk
j=1 h = 1
+Wito)(8°;t,x)er(t-to) ^ aj$ = 0, for i e N.
j=1
(4.5)
System (4.5) can be viewed as a set of equations linear in
{pg0 (t,x),p2° (t,x), ■■■ (t,x)} with Wx°\80; t, x)er(t-to) being taken as a parameter. Solving (??) yields:
tf0 (t,x) = ago + W(t0 )(8°; t,x)er(t-to)
(4.6)
where a$o and $g o, for i e N, are constants involving the parameters in (4.5).
K
e
rt
0
Proposition 4.1. System (4-3) admits a solution
W(t0 )(û°; t,x) = [Ago (t)x + Cgo(t)] e-
r(t-t ),
with
(4.7)
n (b"o)2 n
<o (t) = (r + Seo ) AÎO (t) - £ K-LL № (i)]2 + £ h°
j=i j C*go(t) = rCUt)
j=1
K=1
- £ ( aK - £ ßK{a + £ j [aho + ßho Ago (t)]} I {
j=i h=i
+ £ k aho + ß^ho A"o(t)]} - + £ j ao + ßjjo A"o(t)]} - eK
h = 1
j=1
-A" (t)
Yaï {O + £ j Oh* + ßt«oA;«o (t)]}
L j=i
h =1
Aeo(T) nj=i geo and Cg(T) n=i 9Jjox"o -
■j=iy " o-
Proof. See Appendix 2.
□
Using (4.4), (4.6) and (4.7), the control strategy under cooperation can be obtained as:
be
^(t,x) = alo+lSloA*eo(t) and (t, x) = -^A*go (t)x^, (4.8)
for t € [t0 < T] and i =1, 2, ■ ■ ■ ,n.
Substituting the optimal control strategy from (4.8) into (4.2) yields the dynamics of pollution accumulation under cooperation as:
x(s) = £ aj [ j + £ ß[ (Oho + ßh° A"o (s))] j=1 h=1
n (b° )2
+ £^-Ae0°(s)x(s) ~S8°x(s)> x(t0)=xto. (4.9)
2ca " j=i j
(4.9) is a linear differential equation with time varying coefficients. We use [x*(s)}s=t to denote the solution path satisfying (4.9). The term x* is used interchangeably with x * (t).
A remark that will be utilized in subsequent analysis is given below.
Remark 4.1. Let W(т; t, x) denote the value function of the control problem with objective (4.1) and dynamics (2.8) which starts at time т. One can readily verify that W(т)(<9g;t,x)=W)(0g;t,x)er(T-to), for т G [t0,Tj.
Now consider the case when existing technologies are adopted throughout the cooperative scheme. To secure group optimality the participating nations seek to maximize their joint expected payoff by solving the following control problem:
max
n
E
ç=i ;
[a - E ft+ Eßh vh(s))](aç + Eßl vh(s))
j=1 h=1 h=1
-cç a + E ßh Vh (s)] - c^ (s)]2 - hf0 x(s) - e $
h=1
e-r{s-;0) ds
П p П1 V2(1,ai) Vk[(1,a1)(2,a2)...(k-1,ak_1)]
,a0] ST- ^2[1,ai] ...
ak = 1
+EE EAai°,a0] E ^
ç=1 k=1 ai = 1 a2 = 1
-xk[(1,ai)(2,a2)...(k-1,ak_i)]
;k+i x j a
;k
/ г n n n
^ßj (aj + E ßh Vh(s))](aç + E ßh Vh(s))
t L j=1 h=1 h=1
-cka + Eßh Vh(s)] - cau(s)]2
h=1
gk[(i,ai) (2,a2)...(k-i,ak_i)]
-hçak x(s) - e
k[(i,ai) (2,a2)...(k-i,ak_i)]
3-r(s-;o)ds
k
Ç
n Vi V2(i,ai) VT[(i,ai)(2,a2)...(p,ap )]
+ ^ X1al°,a°] >2[1,ai] ... XT[(l,ai)(2,a2)...(p,ap)]
ç =1 ai =1 a2 = 1 a[ =1
dl T [(i,ai) (2,a2)...(p,ap )] X T [(i,ai) (2^2) ...(p,ap )] aT aT
- x(T)]e-r(T-t'o) for i G N. (4.10)
subject to (2.10).
Following the analysis leading to Propositions 3.1, 3.2, 3.3 and 4.1, one can obtain:
Proposition 4.2. For the case where gak(1'ai)('2'a2)'"(k ^^^ occurs in time interval [tk,tk+i ) at time ta for k G {0,1,2, ■ ■■ ,t}, the value function W(tk)(eak(1'ai)(2'a2)-(k-1'ak-l)]; t,x) can be expressed as:
W(tk)(ßk[(i,ai)(2,a2)...(k-1,ak-i)] ; t, x) =
[Afc(0fc[(1>ai ) (2,a2)...(k-1,ak-i)\ ; t)x + Ck (ßk[{^,ai) (2a) ■■■(k-1,aa-i)\ ; t)] e-r(t-ta)
ak ; ak ; (4'il)
where Ak(eW'ai)(2'a2)---(k-1'aa-i); t) and Ck(6m,ai)(2,a2)-(k-1,aa-i); t), for i e N, satisfy the following sets of differential equations:
"k[ ]
n (b"ak )2 n "k[ ]
Ak(0; t) = (r + s kl, ) Ak (ell] ; t) - E H^r- ^ ] ; i))]2 + E Vfc -
ak j=i 4cj j=i
Ck (e^; t) = rCk (e^; t)
-Y (a -Ëßç{O + £ßh [% ]+ ß^, ]Ak(ek[];t)]} j=1 h =1 "a "a
n \
O + E ßh [Ohk[• ]+ ßhek[.]Ak(ek[]; t)]
n "a[ ]
-CÇ{O - £ßÇ [j.] + j•.]Ak(ek[];t)]}-E"a
-Ak(ek N; t)
"ç ] L aa[• ] 1 t aa
"a "a
j=1kk
"ak[ ]
a {aj + £ ßh [&hgai.] + ßhea[.]Ak(eki];t)]}
j=1 j h=1 h "a "a a
Va +1[(1,ai)(2,a2)...( k ,a a )]
Ak (ek [•]• T ) = £ Ak+1[(1,ai)(2,a2)...(k,a a )]
a a+i
xA^^ek+m,^ )(2,a2)-(k,a a )] ; tk+i,x),
Va +i[(i,ai)(2,a2)...( a ,a a )]
Ck (ekH - T) = £ Ak+1[(1,ai)(2,a2)---(k,aa)]
a +i
xC k+1(ek++1^(1'ai )(2,a2)-(k,a a )]; tk+1,x); where e^J is the short form for ek[(1'ai')(2'a2)''(k 1,aa-i)], and
Ar+^er+m^vMr,^)]; tT+i,x) = -Y, g"
ç=1 "a T
T [(1 ,a 1) (2,a2)...(T,aT )] ,
CT+1(e^ai)(2'a2)-(T'aT)]; tT+i,x) = £ g^
JaT + 1 ; tT +1'x) = 7, g"T [(1,ai)(2,a2)...(T,aT )]
ç=1 aT
V"T [(1 ,a 1 ) (2,a2)...(T,aT )] ' "aT
□
In the case where 8tk(1'a1)2'a2)"(k 1,ak-1)] occurs in time interval [tk,tk+i) at time tk for k € {0,1, 2, ■ ■ ■ ,t}, the optimal control strategy under cooperation can be obtained as:
№
tfiO^x) = -L-A^-ttf/2, and
$(8^;t,x) = «3* + t), for i € N and t € [tk,tk+i); (4.12)
°ak °ak
where 8^} is the short form for 8a](1'ai^(2'a2)---(k 1,ak-1)], and a® kN and /kN are
®ak ®ak
the counterparts of ttlo and /3®o in (4.8).
"o ®0
Substituting the optimal control strategy in (4.12) into (2.10) yields the cooperative state trajectory using existing technologies as:
n 1 n
f)k[-\ _ ~
±(S) = E j {«j + E j[%, + aVA(8kk];«)]} j=i h=i ak ak
gk[■] n j )2
+ V]^)' x(tk) = xtkeX, (4.13)
. . 2c«
for s G [tk,tk+i) and k €{1, 2, • • • ,p}.
(4.13) is a linear differential equation with time varying coefficients. We use
9k[(l,a1)(2,a2)...(k-l,ak-1)] ]
x°ak (s) ^ to denote the solution path satisfying (4.13).
sE[tk,tk+l)
(2,a2)...
k[(1,ai) (2,a2)...(k-1,ak-l)]
-k[(1,ai) (2,a2)...(k-1,ak-1)]
The term XUak (t) is used interchangeably with
ak
xt
A remark that will be utilized in subsequent analysis is given below.
Remark 4.2. Let Wk(T)(2'a2)-(k-1'ak-l); t,x) denote the value function of the control problem with objective (4.10) and dynamics (2.10) which starts at time t. One can readily verify that
WWk(T) (8a^í1'a1)í2'a2)■■■{k-1'ak-1)].; t, x) = WW (tk ) (88k^1'a1) i2'a2)---(k-1'ak-1)] ; t, X)er(T tk) .
Climate-preserving technologies will be adopted if the total cooperative gain using these technologies is larger than the expected total cooperative gain using existing technologies, that is W(to)(80; t0,x0)> W(to)(80; t0,x0). Moreover, if along the optimal path {x*(s)}J=to, W (tk)(80; tk,x*k )> W(tk) (80; tk ,x*tk), for all k € {0,1, 2, ■ ■ ■ ,p}, climate-preserving technologies will be adopted throughout the duration [t0,T]. We first consider the case when climate-preserving technologies are chosen throughout the cooperation duration. In Section 6 the case of partial adoption of climate-preserving technologies will be examined.
4.2. Individually Rational and Dynamically Consistent Imputation
An agreed upon optimality principle must be sought to allocate the cooperative payoff. For t € \tk,tk+1), let )j(80; t,xT) denote the solution imputation (payoff under cooperation) over the period [tk,tk+1) to player i € N as viewed at time t. In
a dynamic framework individual rationality has to be maintained at every instant of time within the cooperative duration [t0,T] along the cooperative trajectory {x*(s)}s=t . Individual rationality along the cooperative trajectory requires:
e(T)i(o0°; T, x*T) > yk(T)i(0°; T,x*r),
for i € N, t € [tk,tk+1) and k €{0,1, 2, ■ ■ ■ ,p}, (4.14)
along the optimal trajectory {x* (s)}'T=t0.
Since nations are asymmetric and the number of nations may be large, a reasonable solution optimality principle for gain distribution is to share the gain from cooperation proportional to the nations' relative sizes of expected noncooperative payoffs. To ensure that the cooperative solution is dynamically consistent, the condition of time consistency has to hold. Time consistency requires the solution opti-mality principle determined at the outset to remain effective at any instant of time throughout the game along the optimal state trajectory. Since all the participants are guided by the same optimality principle at each instant of time, they do not have incentives to deviate from the previously adopted optimal behavior throughout the game. Thus the optimality principle governing the agreed-upon imputation must be maintained throughout the cooperation period to secure time-consistency.
Hence the solution imputation scheme £k(T)i(0°; T,xT), for i € N and k € {0,1, 2, ■ ■ ■ ,p}, has to satisfy: Condition 4.1.
= (4.15)
for i € N and t € [tk,tk+1) and k € {0,1, 2, ■ ■ ■ ,p}, along the optimal path fT
it=t o .
{xT}Tr=t,o . □
One can easily verify that the imputation scheme in Condition 4.1 satisfies individual rationality. Crucial to the analysis is the formulation of a payment distribution mechanism that would lead to the realization of Condition 4.1. This will be done in the next Section.
5. Payment Distribution Mechanism
To formulate a payment distribution scheme over time so that the agreed upon imputation (4.15) can be realized for any time instant t € [t°,T] we apply the techniques developed by Yeung and Petrosyan (2004 and 2006b). Let the vector
Be<0(s,x*s) = [B0 (s,xS ),B20 (s,xS ),■ ■ ■ ,B6r0 (s,xS)] denote the instantaneous payment to the n nations at time instant s when the state is xS for s € [t°,T]. A
terminal value of g\0 [xi„0 — xT] will be offered to nation i at time T.
U0 U0 1
To satisfy Condition 4.1 it is required that
= m**(r),(*oV,*;) W(r){0o.T>a;)
E yk(T)i(0°°; T,xT) j=i
j Bf0 (s,x* (s))e-r(s-T)ds + glo [xf0 - x*T]e
-r(T-t )
for i € N and T € [tk,tk+1) and k €{0,1,2, ■ ■ ■ ,p}. (5.1)
To facilitate further exposition, we use the term £k(T)i(0°; t,x*) which equals
J Bf0 (s,x* (s))e-r(s-T)ds + glo [xfo - x*T]e
-r(T-T )
E Vk(T)l (0°; t,x* )
j=i
yWfâ]-t,x*t)
n
E Vk(t)i(0°; t,x*) j=i
-W (t)(0°0; t,x * )e-r(t-T ) = £k(t)i(0°0; t,x * )e-r(t-T ),
for i G N and t G [tk,tk+1) and t G [T,tk+1),
(5.2)
to denote the present value (with initial time set at t) of nation i's cooperative payoff over the time interval [t, T].
Theorem 5.1. A distribution scheme with a terminal payment gi0 [x\0 — xT ] at
U 0 U 0 1
time T and an instantaneous payment at time t € \tk,tk+i) equaling
Bf0 (t, x. dT )l(o0; t,x* )
ik{T ]i (o0; t,x* )
t=
n n
y,4 [«j +E j (t,xt )]
l j=i
h=1
+ e bf00 (t,xt)(xT)1/2 - s6
j=i
for i G N and t G [tk,tk+i) and k G{0, 1,2,- • • ,p}, (5.3)
yield Condition 4.1.
Proof. Since £k(T)i(B'0;t,x*) is continuously differentiable in t and x*, using (5.2) and Remarks 3.1 and 4.1 one can obtain:
T+At
J Bf0 (s,x* (s))e-r(s-T)ds
Çk(T)i(00°; t, xT) - e-rAttk(T+At)i(00°; t + At, xT+At)
t=T
x
= )i(000; T, xT) - e(T)i(000; r + At, x*T+At),
for i G N andr G [tk,tk+i) and (r + At) G [tfc,tk+1), where
(5.4)
AxT
L j=i
[«j + E j tf (T,XT )] + E j ^ (t,xt )(xT )1/2 - Ö,
h=1
j=1
At
+o(At),
where o(At)/At ^ 0 as At ^ 0.
With At ^ 0, condition (5.4) can be expressed as:
b:0(r,xT)At+o(At) = - et{T^K;t,x*t)
At
dT }i(û0; t,xt )
[«j + E j (t,xt )]
L j=1
h=1
+ E j ™:0 (r,xT )(xT )1/2 - 0 xT
j=i
At,
(5.5)
Dividing (5.6) throughout by At, with At ^ 0, yields (5.3). Hence Theorem 5.1 follows. □
Theorem 5.1 provides a payoff distribution procedure leading to the satisfaction of Condition 4.1 and hence a dynamically consistent solution will be obtained. When all nations are adopting the cooperative strategies the rate of instantaneous payment that nation k G N will realize at time t with the state being Xt can be expressed as:
0 0
h (t xt
x
t=T
t=T
^ - £ßh{&j + E ßh [Âh„ + ßh0 At0(r)]} I {âh + E ßh [âh„ + ßh0 At0(r)]}
j=1 h = 1 / h = 1
00 a( ¿h"
--cK{âK + ^ft Ko + ßlo A;oo(t)]} - - c» ] X* - Cxî . (5.6)
Since according to Theorem 5.1 under the cooperative scheme an instantaneous
o
payment to nation k equaling bko (t,x*t ) at time t with the state being xT, a side
go go
payment of the value bko (t,x*t)—kko (t,x*t) will be offered to nation k.
2
6. Partial Adoption of Climate-preserving Technologies
In this section, we examine the case where climate-preserving technologies are not adopted throughout the duration [t°,T]. Consider the situation when along the optimal path {x* (s)}T=t0, W(tk)(0°°; tk,x*J> W(tk)(0°; tk,x*k), for k € {0,1, 2, ■ ■ ■ ,(}. At time tz, W (t< )(0°; tz ,x*( )> W (t< >(0°; tz ,x*(). This would induce the nations to switch back to non-climate-preserving technologies.
For notational convenience, we denote 0^=0°, for k €{0,1, 2, ■ ■ ■ ,p} whenever applies. At time tz the climate condition can be expressed as 0Z[(1'1) (2,1)...(z 1,1)1. In the time interval [th, th+1), for h € {Z +1, Z+2, ■ ■ ■ , p}, the random variable representing the climate condition can be expressed as ,,fe[(1,1)(2,1)...(C,1)(C+1,ac+i)-(fe-1,ah_i)] 0ah .
The imputations |"(t^Md^ (2,1)...(C-1,1)-(h-1,ah_i)]; t,
gh[(1,1) (2,1)...((_1,1) • • • (h_1,*h_l)] xTah ), for i € N and t € [th,th+1) and h € {Z, Z +1, ■ ■ ■ , p},
which share the gain from cooperation proportional to the nations' relative sizes of expected noncooperative payoffs, require the following condition to hold.
Condition 6.1.
= for r€[ic>ic+1),
0
EyZ(T )j (0°; T,xT0)
0 j=1
(6.1)
and
Th[ • ] y h(T )i(0h['] . T x Thh ) Th[ • ]
r, ) = mv {dah^XT X ), (6.2)
E yh(T)j (0hh ]; T,Xrh ) j=1
f M-] 0h[(1,1)(2,1)...(Z,1)(Z+1,at+1)-(h-1,ah_1)] ah ah
€ {0h[(1,1)(2,1)...(C,1)(C+1,aC+1)---(h-1,ah_1)] 0h[(1,1) (2,1)...(Z,1)(Z+1,a(+1)-(h-1,ah_1)]
0h[(1,1)(2,1)...(Z,1)(Z+1,a<+1)-(h-1,ah_1)]l
, Urlh[(1,1) (2,1)...((,1)((+1,a(+1) • • • (h_1,ah_1) },
Z+1) • • • (h_1,ah_
( Th[ • ] 1 th+1
i € N and t € [th , th+1), along the optimal path < xTah > and h € {Z +
T=th
1,Z + 2, ■ ■ ■ ,p}.
Following the analysis leading Theorem 5.1, a dynamically consistent distribution scheme can be obtained as:
Theorem 6.1. A distribution scheme with a terminal payment
gi T [(1,1) (2,1)...(C,1)(C+1,ac+1) • • • (p,ap)] [xl T [(1,1) (2,1) ... (Z, 1)(Z + 1 ,af + 1 ) • • •(pap)] — x(T)] if 1 1
0TT(i,1)(2,1)...(Z,1)(Z+1,ac+1) (p,ap)] occurs at time T and an instantaneous pay-
ment at time t € [tz ,tz+1) equaling
Bz(t»K; t,X60 ) = - ]tK, T,&)
f£T ^0 ; t,xT0 )
Yj [Ôp + E j iï(e0; t,xT0 )]
L j=i
h=1 60
+ E j$(0O; T,z6r°)(xT0)1/2 - sg0 x j=i
at time t e [th,th+i) and h e{ + 1,C + 2, ■ ■ ■ , p}
and
B h(T )i(e^; T,x
)h[•]
lh(T )i(0h[t}; TX
eh[-] 6 ah )
íhT W; T,xTah )
6h[:].
6 h[;]
eh[;].
Yj [j + E j (öhh']; T,Xrah )]
L j=i
n nh[ • ] Hh[. ] Hh[. ]
+E jh jh]; t,£ h h )1/2 - st j=i
6 h[ ]
hh[• ] XT h
for i e N,
where 0hah]^ ehh(1,1)(2,1)-(z,1)(z+1,az+i)-(h-1,ah-i)] ah ah i
yield Condition 6.1. Proof. Follow the Proof of Theorem 5.1.
(6.3)
□
Theorem 6.1 provides a payoff distribution procedure leading to the satisfaction of Condition 6.1 and hence a dynamically consistent solution will be obtained. Though the original climate condition could not be preserved with partial adoption of climate-preserving technologies, the expected climate deterioration at terminal T is less than that under a noncooperative equilibrium.
7. Concluding Remarks
In this analysis, we present a differential game of pollution management with climate change. Though continual adoption of non-climate-preserving technologies would lead to further irreversible climate deterioration, nations would not to switch to climate-preserving technologies while other nations are using non-climate-preserving technologies. A global ban on non-climate-preserving technologies would unlikely receive unanimous approval because some nations with higher production cost differentials after switching to climate-preserving technologies would become worse off. Through cooperation under which nations would jointly adopt climate-preserving technologies and share the gains in an acceptable scheme could be halted.
In dynamic cooperation, a credible cooperative agreement has to be dynamically consistent. For dynamic consistency to hold the specific optimality principle must remain in effect at any instant of time throughout the game along the optimal state trajectory chosen at the outset. In this paper, dynamically consistent cooperative solutions and analytically tractable payoff distribution procedures are derived.
t=T-
t=T -
t=T -
t=T-
Moreover, the solutions are obtained in explicit closed-form so one can calculate the intended results with given parametric values. This approach widens the application of cooperative differential game theory to environmental problems where climate change occurs. Since this is the first time cooperative differential games are applied in climate change control, further research along this line is expected.
Appendix 1. Proof of Proposition 3.1.
Using (3.4) and (3.5), system (3.3) can be expressed as:
r[Ap(P; t)x + Cp(0PM; t)] - [Alpt)x + Cp(OpM; t)]
n n _ n r-|
- £ ßj{al + £ ßl [&hoal•• + £ ß^k K(O^;t)]}
j=i
h = 1
ap k=1 ap
n
n
a' + E Ml Oyi• i + E ßha• ]k Ak (0P t)] ap ap
h=1 k=1
-ci{ai + Eßj [ajea, i + Eßl Apik(0pp]; t)]}
j=i
k=1
0P[-]
_ra
' 2ca iV a
]; t)]2x - hp x - Ep
+Af<]; t)
l ajap {aj+£ n=i ßi aePl • •
ap[• •
\ i ep[• • hap t i
and (A.1)
nT [(l,a1)(2,a2)...(P,ap )1
Ap(0p^; T)x + Cp(ep[}; T) = ]T ^i^)^)...^)]
aT=1
X9„ T[(1,ai) (2,a2)...(P,ap )1 [xo T [(1,ai)(2,a2)...(P,ap )]
- x(T)]e-r(T-tp);
for i £ N.
For (A.1) and (A.2) to hold, it is required that
(A.2)
a
T
T
t) = (r + S a • i)AT (OO-P t) -
];t) E
j = i
j =i
gp[ • 1
PZ1
2c«
Ap (Op; t)
];
aP[• • r i
ilPap \2 n 0P[ • •
AP (OP; T )= -
VT [(1,ai)(2,a2)...(P,ap )
£ A,
aT=1
T [(1,ai)(2,a2)...( p ,ap)]
(A.3)
9„ T [(1,ai) (2,a2)...(P,ap)1 '; (A.4)
T
Consistent Solution for a Cooperative Differentia,l Game and Cp(epa[p]; t)=
n n n
rCp(0^; t) -( ai j{aj + £ 4h [a^p,]+£^ Ak(0opM; t)]} ) ^ j=1 h=1 k=1 '
n
a + E4h [a hePj.] + £-lk Ak(oppi;t)] h=1 P k=1 P
n
n
+Ci{ai -J24 [afp[-i + E jj p.ik Ap(opp;t)]} + £.
j=1
k=1
-Ap(0P[p]; t)
n fP[-. n _ . n r 1
n {aj + n 4 [ahp- 1 + n ^-U Ak(0pp];t)]}
L j=1
Cp (op P ; T )
h =1 k=1 (2,a2
E AT
[(l,a1)(2,a2)...(p,ap )]
\T [(1,a1 ) (2,a2)...( p ,ap)\
aT = 1
Xg„T [(1,ai)(2,a2)...(p,ap )] x T [(1,oi) (2,0.2) ■■■ (POp )1 .
faT faT
Hence Proposition 3.1 follows. Q.E.D. Appendix 2. Proof of Proposition 4.1.
Substituting (4.4) and (4.6) into (4.3) and using (4.7) one obtains: r[AU (t)x + C'L (t)] - [A*ea (t)x + C'L (t)] =
(A.5)
(A.6)
K=1
E I aK ^ 4K{aj + E j [âh00 + jh0 A*60 (t)]} j=1 h =1
n
x{âK + £ j [<& + (t)]}
h=1
-Âk{Âk + E j [<Â j=1
f
+ f?B% A*go (i)]} - C% [^A* „ (t)fx - ee! - heIx
j
+A*g0 (t)
£ af{cP + + k Ak t*)» + E HJrAlo (t)x - Seox
L j=1
h =1
1j
[Af 0 (T)x + C*0 (T)]= E g^c [x«60o - x(T)].
For (A.7) and (A.8) to hold, it is required that
/>0
n (o 0 )2 n a0
A;0(t)=(r + Sdo) A*eo(t)~ E E h*°
0 00 '17 0 • -i
3 = 1 7 3 = 1
n
Ah (T )=- E gj
j=1
j
;
0
(A.7) (A.8)
(A.9) (A.10)
ch (t) = C (t)
-Y
K=1
j=i
j{ô? + E j [a
ào + P^jo AnO
^ Bn Bn
(t)]}
h =1
x{aK + £ PK [ajo + Pjo A*Bo0 (t)]} - cK {aK + £ j [j + Pjo A*Boo (t)]} - eK
h = 1
j=1
-AB o (t)
n a o B
£ «Bo{aj + ,£ /?h [ajo + jjo ABo(t)]}
L j=i
h=1
c*eo(T gB o xB o •
Bo j=i Bo Bo
(A.11) (A.12)
Hence Proposition 4.1 follows. Q.E.D. Acknowledgements
This research was supported by the European Union TOCSIN Research Grant and the HKSYU Research Grant.
References
Allen, M., Stoott, P. A., Mitchell, J. F. B, Schnur, R. & Delworth, T. L. (2000). Quantifying The Uncertainty in Forecasts of Anthropogenic Climate Change. Nature, 407, 617-620.
Bellman, R. (1957). Dynamic Programming. Princeton, NJ: Princeton University Press.
Berliner, L. M. (2003). Uncertainty and Climate Change. Statistical Science, 18(4), 430435.
Breton, M., Zaccour, G., & Zahaf, M. (2005). A Differential Game Of Joint Implementation Of Environmental Projects. Automatica, 41, 1737-1749.
Breton, M., Zaccour, G., & Zahaf, M. (2006). A Game-Theoretic Formulation of Joint Implementation of Environmental Projects. European Journal Of Operational Research, 168, 221-239.
Dixit, A.K. (1979). A Model of Duopoly Suggesting A Theory of Entry Barriers. Bell Journal of Economics, 10, 20-32.
Dockner, E.J. & Leitmann, G. (2001). Coordinate Transformation And Derivation of Open-Loop Nash Equilibria. Journal of Economic Dynamics and Control, 110, 1-15.
Dockner, E.J. & Long, N.V. (1993). International Pollution Control: Cooperative Versus Noncooperative Strategies. Journal of Environmental Economics and Management, 25, 13-29.
Feenstra, T., Kort, P.M., & De Zeeuw, A. (2001). Environmental Policy Instruments in an International Duopoly With Feedback Investment Strategies. Journal of Economics Dynamics & Control, 25, 1665-1687.
Fredj, K., Martn-Herrn, G. & Zaccour, G. (2004). Slowing Deforestation Pace Through Subsidies: A Differential Game. Automatica, 40, 301-309.
Isaacs, R. (1965). Differential Games. New York, NY: Wiley.
J0rgensen, S. & Zaccour, G. (2001). Time Consistent Side Payments in a Dynamic Game of Downstream Pollution. Journal of Economic Dynamics and Control, 25, 1973-1987.
Nash, J. F. (1951). Non-Cooperative Games. Annals of Mathematics, 54, 286-295.
Petrosyan, L. & Zaccour, G. (2003). Time-Consistent Shapley Value Allocation of Pollution Cost Reduction. Journal of Economic Dynamics and Control, 27, 381-398.
Singh, N. & Vives, X. (1984). Price and Quantity Competition in a Differentiated Duopoly. Rand Journal of Economics, 15, 546-554.
Stimming, M. (1999). Capital Accumulation Subject to Pollution Control: Open-Loop Versus Feedback Investment Strategies. Annals of Operations Research, 88, 309-336.
Tahvonen, O. (1994). Carbon Dioxide Abatement as a Differential Game. European Journal of Political Economy, 10, 685-705.
Yeung, D.W.K. (1992). A Differential Game of Industrial Pollution Management. Annals of Operations Research, 37, 297-311.
Yeung, D.W.K. (2007). Dynamically Consistent Cooperative Solution in a Differential Game of Transboundary Industrial Pollution. Journal of Optimization Theory and Applications, 134(1), 143-160.
Yeung, D.W.K. & Petrosyan, L. (2004). Subgame Consistent Cooperative Solutions in Stochastic Differential Games. Journal of Optimization Theory and Applications, 120, 651-666.
Yeung, D.W.K. & Petrosyan, L. (2005). Subgame Consistent Solution of a Cooperative Stochastic Differential Game With Nontransferable Payoffs. Journal of Optimization Theory and Applications, 124(3), 701-724.
Yeung, D.W.K. & Petrosyan, L. (2006a). Dynamically Stable Corporate Joint Ventures. Automatica, 42, 365-370.
Yeung, D.W.K. & Petrosyan, L. (2006b). Cooperative Stochastic Differential Games. New York, NY: Springer-Verlag.
Yeung, D.W.K. & Petrosyan, L. (2008). A Cooperative Stochastic Differential Game of Transboundary Industrial Pollution. Automatica, 44, 1532-1544.