ANALYTICAL ESTIMATION OF RELIABILITY ОF NETWORKS CONSISTING OF
IDENTICAL ELEMENTS.
O. A. Tkachev •
Moscow Aviation Institute, Moscow, 125993, GSP-3, A-80, Volokolamskoye shosse 4, Russia. e-mail: [email protected]
ABSTRACT
In this work the analytical expressions for reliability factor of steady-state network consisting of identical elements are accomplished. For obtaining of the analytical expressions of considered factors, the Markov process characterized by a certain combination of failed elements and by a network state is considered. It is shown that for definition of values of the obtained expressions it is sufficient to define the number of combinations of set capacity elements at which failure the network passes in non-operable state. For small dimension networks this factor may be defined precisely by analysis of all possible combinations of failed elements. For large-scale networks it is necessary to use Monte Carlo method or combined method.
1 INTRODUCTION
At network reliability assessment, reliability factors which define dynamics of behavior of the network are of great interest. Following are considered as reliability factors: mean time spent in operable and non-operable conditions within the steady-state mode. The simplest criterion for network operability is its connectivity. The network is considered to be connected if between any pair of nodes exists at least one path.
In the course of deducing inference, we will use connectivity of a network as criterion of its operability. In section 5 the possibility of application of the obtained expressions at use of other criteria of a network operability is justified.
If connectivity is used as a network operability criterion, than mean time spent in the connected state corresponds to mean time to failure (MTTF), and network mean time spent in the disconnected state corresponds to mean time to repair (MTTR). Therefore MTTF and MTTR may be considered instead of mentioned reliability factors.
2 NETWORK RELIABILITY MODEL AND OBTAINING OF ANALYTICAL EXPRESSIONS
Let us assume that network nodes being absolutely reliable and edges being identical on reliability fail independently from each other and their time to failure and time to repair are exponentially distributed with parameters X and p.
Let us imply moving off of an edge from the network till its repair to be edge failure. Distinctive feature of networks which complicates definition of reliability factors is that at certain number of failed edges network may be connected as well as disconnected. For example, network presented on Figure 1 may not be precisely defined as connected then two or three edges are removed.
Figure 1: Example of the network
The cuts of capacity i is employed as a key parameter which allows us to define considered reliability factors of the network. Let us denote this parameter by Yi.
Let's define Yi for the network presented on Figure 1. Since considered network is biconnected, therefore one edge moving off cannot break its connectivity. Hence Yi=0. For definition of Y2 and Y3 we consider all possible states of the network when 2 and 3 edges are moved off respectively, Figure 2.
Analyzing the data presented on Figure 2, it is possible to define that 2 cuts of capacity 2, and 14 cuts of capacity 3 are within the considered network, hence y2=2, Y3=14. Any
combination of i edges at i > 3 is a cut, hence Yi =
V i y
at i > 4.
At small n, values of Y, may be defined by exhaustion of all possible states of the network. At great values of n, place Monte Carlo method or combined method should be used.
( A } l.k' ¡=2 \ J Connected states i=3 l.k'
2,1' 1.2 -- 1.2.3 3.1'
2.2' 2.3' 1.3 1.4 V 1.2.4 1.2.7 3.2' 3.3'
2.4' 1.6 1.3.6 3.4'
2.5' 1.7 1.3.7 3.5'
2.6' 2.3 1.4.6 3.6'
2.7' 2.4 1.4.7 3.7'
2.8' 2.5 1.6.7 3.8'
2.9' 2.6 2.3.5 3.9'
2.10' 2.7 2.3.6 3.10'
2.11' 3.5 2.4.5 3.11'
2.12' 3.6 2.4.6 3.12'
2.13. 3.7 2.5.7 3.13'
2.14' 4,5 2.6.7 3.14'
2.15' 4,6 3,5,6 3,15'
2.16' 4,7 3.5,7 3,16'
2,17' 5,6 3.6.7 3,17'
2,18' 5,7 4.5.6 3,18'
2,19' 6,7 4.5.7 3,19'
4.6.7 3,20'
5,6,7 3,21'
l.k" l.k"
Disconnected states \\ \ ^^ CB)
1,2,5 3,1"
1,2,6 3,2"
1,3,4 3,3"
1,3,5 3.4"
1,4,5 3,5"
1.5,6 3,6"
1.5,7 3,7"
2.3,4 3.8"
2.3.7 3.9"
2.4.7 3,10"
2,5.6 3,11"
3,4,5 3,12"
2,1" 1,5 3,4,6 3,13"
2,2" 3,4 3,4,7 3,14"
Figure 2: Network state transition at failure of i edge (i=3)
Knowing Yi it is possible to define probability of i edges to become disconnected in case of failure. Let us denote this factor by - Z¡.
Value of Zj is equal to ratio of Yi to the total number of possible combinations of i of n elements where n is the number of edges in the network, expression (1).
' n *
ViJ
Consider a Markov process which describes network state transition at failure and repair of its edges. Each state is characterized by a certain combination of failed edges and by a network state.
Denoted by E+ is the set of operable states of the network, E_ is the set of non-operable states of the network.
Analytical expressions for definition of an average stay time of Markov and semi-Markov process in a subset of states have been obtained by Ushakov (1969a, 1969b).
For the considered case B.V.Gnedenko (1983), the average stay time of Markov process on a set of states E+ before the first entering to a state of set E_ can be defined from expression:
IP IP
TE, =^EWt = (2)
I (P ) I (Pj IX)
where Xtj - rate of transition from state i e e+ to state j e E_, Xji - rate of transition from state j e E_ to state i e e+ .
Similarly mean time for the process to be within the set of states E_ is defined
IP IP
TE = jeE- -= jeE- (3)
E- I (pj IX) I(PIX)
jeE_ ieE. ieE. jeE_
Let us denote connected states by (i, k) pair and disconnected network states by (i, k ) pair at i failed edges. Then the left parts of expressions (2) and (3) may be written as follows:
n
I I P,-
Te. = n" "E' (4)
I IPA
, i,k i ,k
i=° k eE.
n
I IP/
Te = --(5)
E
IIP M"
'k eE_
where Pik-, Pik- - - probability of states; and A*k- - rate of transition from state (i, k) to
states belonging to the set E_, j ,, - rate of transition from state (i, k ) to states which belong to
i,k
the set e+.
Summary edge failure rate at i state equals to (n - i)A, and summary rate to repair - ij therefore:
A , =(n - i)az * , (6)
(i,k ) (i,k)
ieE. jeE
jeE_ ieE.
* _ **
ß „ = ißZ ,,
(i,k ) r (i,k )
(7)
where Z*ik,) - probability of network transition to disconnected state from (i, k) in case of one edge failure;
Z**k,,) - probability of network transition to connected state from (i, k ) state in case of one edge repair.
For definition of TE and TE from expressions (4) and (5) it is necessary to know values
P , and P „ . Since all edges are identical on reliability therefore all states with i failed edges
(i,k ) (i,k )
are equiprobable.
The probability of i edges in the network to be failed is equal to
(W
P =
fn\
vi y
The total number of such states is
vi y
(1 + X/ß)n therefore
(8)
P , = P , = P = P - (Â/My
(ik ) (ik )
n
vi y
(1 + X/ß)n
(9)
Further, using Zi defining from expression (1), it is possible to define the number of disconnected states at i failed edges.
Y = Z
Similarly, the number of connected states
n
viy
(10)
n
vi y
- Y =(1 - Z. )
n
vi y
(11)
If we substitute (6), (7), (8), (10) (11) into (4) and (5), and placing variables independent of i and k outside summation symbols, we obtain
I(1 - Zi )
n
T =
-i T7
i =0
P
vi y
¿IP (n - i) z z . )
(12)
i=0
k EE
Z(Zi )
n
T=
E
i=0
P
vi y
(13)
i=0 k"eE_
Practical use of the obtained expressions is made difficult by necessity to consider all possible 2n states of the systems for definition of Z(* k,); Z(7k^.
In Tkachev (1999, 2010) it is shown, that defining of Zi value is sufficient to define the values of expressions (12) and (13).
+
If at i failed edges the network is in one of connected states, then at one of (n-i) operational edges failure the network may transit to one (n-i) states with i+1 number of the failed edges. Let
S , of them be disconnected, then from definition Z , follows that
(i,k) (i,k)
S ,
Z* = (',k ) (i k) ■ n -i
For example, Z \ =2/5 (point A on Figure 2).
Now, sum expression may be presented as Z
(14)
(i,k )
i z v = i
^ (i ,k ) ^
S
(i,k) _
C
k eE
k eE
n - i
n - i
(15)
where Ci - the total number of transitions from the set of connected states (i, k) to the set
of disconnected states (i +1, k ).
For definition of Ci value the following reasoning may be employed.
If the network is in one of Y disconnected states, then at failure of one of (n-i) operational edges the network may transit to another disconnected state with the number of failed edges i+1. The total number of transitions from disconnected states with the number of failed edges i to disconnected states with the number of failed edges i+1 equals to Yi (n - i). Then value of Yi+1 may
be expressed through Y and Ci as follows:
Y+1 =
Yi (n - i) + C
i +1
(16)
The denominator equals to i+1 because the network can transit in each state with the number of failed edges i+1 from i+1 states with the number of failed edges i (point B on Figure 2)
On the other hand, in compliance with definition
Y
Z = i+1 A!+1 f ' n
Vi + 1J
If we substitute Yi+j from (16) to (17) we obtain
Zi+1 =
Yt (n - i) + Ci
r n ^
(i +1)
Vi + 1J
In addition,
n
Vi + 1J
n
Vi J
n-i
i+1
(17)
(18)
(19)
If we substitute expressions (1°) and (19) to (18), then after simplification we obtain
Z
n
(n - i) + Ci
Zi+1 =
_ V J
n
(2°)
ViJ
(n - i)
whence it follows that
in )
Ct=(ZM - Zt) . (n - i) (21)
Therefore
vi j
C
Z 7*- = ~ = (Z,1 - 7t) . (22)
,. (i,k) n -i
v1J
Substituting (22) to (12), we obtain
Z(1 - 7)
n
P
= I ~-V1J ^ (23)
X^P, (n - i)(ZM - 7,)
v1j
After notation we obtain
n
R =(1 - 7) . P (24)
v1j
1 - Zt
Thus, expression (23) may be brought to the form
n
7 - 7
Z*= fiiL^ (25)
\ = - (26)
XZR (n -1)7*
i=0
The numerator of this expression is probability for the network to be in connected state at an arbitrary point of time. If we substitute R from (24) and P from (9) we obtain
1 " f n)
R = n V v Z(1 - 7,) . (XM)1 (27)
(1+XM)nU V i J
By similar reasoning it is possible to obtain expression for system's mean time spent in the disconnected state from (13).
If at i failed edges the network is in one of disconnected states, then at repair of one of i
edges the network may transit to one of i states with the number of failed edges i-1. Let S ,, of
(i ,k )
them be connected, then from 7**k„) definition follows that
S ,
7= _oil (28)
(i,k ) i
Now sum expression may be given: 7**k„).
S '' c"
Z 7 - = Z -JhLL = ^ (29)
^ (i,k ) ^ i i
k eE_ k eE_
where Ci - the total number of transitions from disconnected set (i, k ) to connected set (i -1, k).
For definition of Ci the following reasoning may be used.
If the network is in one of Y disconnected states, then at repair of one of i failed edges the network may transit to connected or disconnected state with the number of failed edges i-1. The total number of transitions from disconnected states with the number of failed edges i, is equal to
(Y)* i. Ci of them are transitions into connected states. Then value of Y-1 be expressed through Y and Ci as follows:
Y = Yi-Cl
'-1 n - (i -1)
(3°)
The denominator equals to n - (i -1) because the network can transit in each state with the number of failed edges i-1 from n - (i -1) states with the number of failed edges i. From expression (3°) we obtain
Ci' = (Y )i - Yi_1(n - (i -1))
(31)
Substituting Yi from (1°) in (31), we obtain
In addition,
C =
n
Z -
ViJ
n
Vi - 1J
Zi-1 (n - (i -1))
f n ^ (n\
Vi - 1J Vi J(n - (i -1))
(32)
(33)
Ci =
n
ViJ
i(Zi - Zt-1)
If we substitute Ci value from (34) into (29) and expression (29) into (13), we get
After notation
we obtain
I(Z)
n
t =
1E
P
Vi J
n
nIPi . (Zi - ZM)
VJ
a=zt
n
P
Vi J
(34)
(35)
(36)
where
T=
E
Ia
i=°
ßIiQr (1 - Z**)
i=°
Z
Z ** = i-1
Z,
(37)
(38)
Results of calculation of Yi, Zt,Z*,Z** values for the network on Figure 1 are presented in
Table1.
Table 1. Y, Z,Z*,Z** values for the network on Figure 1
/ f n ^ Y Z Z * **
0 1 0 0 0 0
1 7 0 0 0,095238 0
2 21 2 0,095238 0,336842 0
3 35 14 0,4 1 0,238095
4 35 35 1 0 0,4
5 21 21 1 0 1
6 7 7 1 0 1
7 1 1 1 0 1
If we substitute Y, Zt,Z*,Z** values from Table 1 into expressions (26), (27), (37), where A =0,01 (1/hour) and u = 1(1/hour), we obtain following values of considering reliability factors.
TE =23,769 hour.; R = 0,980645; TE =0,469 hour.
3 COMPARISON WITH KNOWN RESULTS
For checking of the obtained expressions we define values of considered system reliability factors (Figure 3) for which analytical estimations are known.
Figure 3: Parallel - series system
Following expressions may be found in Gnedenko&Belyayev&Solovyov (1965). For s identical elements connected in parallel with X and u parameters.
U =
1
^ * u
1+U) -1
X
u =
1
s * u
r =
tE+ + tE-
For k identical repairing subsystems connected in series.
T = 1 E+ k *(1/tE+)
R = rk 1 - R
te = te -
R
s
+
+
Substituting in this expressions ^ = 2, k = 3, A =0,1(1/hour), u =1 (1/hour) we obtain
TE+ = 20,000 hour.; R = 0,975411; TE = 0,504 hour.
For definition of these reliability factors using expressions (26), (27), (37), values of Yi and Zi should be defined.
It is obvious that y = 0 and zx = 0,0. In case of 2 elements failure there are three combinations resulting in disconnection (nonoperativity) of the system (1,2), (3,4), (5,6). The total number of probable combinations is 12, hence y2 =3 and z2 =3/12. For definition of Y3 and Z3 let us consider all probable combinations of 3 elements (Table 2). As followed from the results provided in Table 2, Y3 = 12 and Z 3 = 12/20. At failure of 4 or more elements, the system will be disconnected, therefore Z4 = Z5 = Z6 = 1.
Table 2. State of the system on Figure 3 at failure of 3 elements.
1 123 - 11 234 -
2 124 - 12 235 +
3 125 - 13 236 +
4 126 - 14 245 +
5 134 - 15 246 +
6 135 + 16 256 -
7 136 + 17 345 -
8 145 + 18 346 -
9 146 + 19 356 -
10 156 - 20 456 -
Table 3. Calculation results of system performance (for Figure 3)
i f n ^ 1i J P Yi Zi Z* y ** Z i Qt R
0 1 0,5644739301 0 0 0 0 0,0000000000 0,5644739301
1 6 0,3386843580 0 0 0,2 0 0,0000000000 0,3386843580
2 15 0,0846710895 3 0,2 0,5 0 0,0169342179 0,0677368716
3 20 0,0112894786 12 0,6 1 0,33333 0,0067736872 0,0045157914
4 15 0,0008467109 15 1 0 0,6 0,0008467109 0,0000000000
5 6 0,0000338684 6 1 0 1 0,0000338684 0,0000000000
6 1 0,0000005645 1 1 0 1 0,0000005645 0,0000000000
The rest results are presented in Table 3. Substituting the data from Table 3 into expressions (26), (27), (37), we obtain:
TE+ = 20,000 hour.; R = 0,975411; TE = 0,504 hour.
4 EXAMPLES OF NETWORK RELIABILITY ANALYSIS
We will consider biconnected networks which node degrees satisfy to the inequality 2 < k < 3.
Let us denote the number of nodes by m and the number of edges by n. It is necessary to pay attention that for each pair of (m,n) there is a topology providing the maximum level of reliability. For definition of such topologies the algorithms offered in Artamonov (1999) have been used.
Example 1. Ring with one diagonal m=20, n=21
It is obvious that y = 0 and Zx = 0. Considering combinations of 2 edges one may see that they form a cut even when they belong to the same chain. In addition, any combination of 2 edges which belong to the same chain is a cut itself. Calculation results of values Y and Z, for the network (Figure4) are presented in Table 4a. Table 4b includes TE , TE and R values calculated for various
X. Value ¡u=1. Here and elsewhere the unit of measurement for X and p is 1/hour, for TE and TE -
hour.
+
Figure 4. Network with parameters m=20, n=21 Table 4a. Yi and Zi values for the network presented on Figure 4.
i 1 2 > 3
Yi 0 63 ( n \ I i J
Zi 0 0,3000 1
Table 4b. TE , TE , R valuesfor the network presented on Figure 4.
A T 1e+ T R
0,1 1,13 0,6983 0,617547
0,01 79,56 0,4997 0,993759
0,001 7928,59 0,4995 0,999937
Example 2. Network with parameters m=20, n=24
As presented in Artamonov (1999), to reach the maximum reliability it is necessary to find the minimal diameter network with 2(n-m) nodes of degree 3, and distribute nodes of degree 2 on the edges of this network equally. The network presented on Figure 5 may be found as a result.
Y values of this network may be defined by exhaustion of all possible states. Results of calculations are presented Table 5a and Table 5b.
Table 5a. Values Yi and Zi for the network on Figure 5.
i 1 2 3 4 5 > 6
Yi 0 12 328 4082 29960 f n 1 I * J
Zi 0 0,043478 0,162055 0,384252 0,704875 1
Table 5b. Values TE , TE , R for the network on Figure 5.
A T 1e+ T R
0,1 2,13 0,3334 0,864576
0,01 390,86 0,4845 0,998762
0,001 41415,64 0,4986 0,999988
5 USE OF OTHER OPERABILITY CRITERIA AND MODELS OF NETWORK RELIABILITY
The connectivity of all nodes of the network has been used as operability criterion so far. However the following question would be appropriate: whether loss of communication with one of nodes is a failure? If we consider the loss of connectivity with k nodes to be admissible, then the following network operability criterion may be used: the network is considered operational, if number of connected nodes > (m - k).
The obtained results may be also used in this case, but respective alterations in algorithm of definition of values Yi should be made. In Table 6 values TE and R at various values of k are
presented for the network on Figure 5.
Table 6. Values TE and R at various values of k, X=0,01, p=1 for the network on Figure 5.
k T 1e+ R
1 5780,49 0,999943591
2 9843,50 0,999967012
3 31455,53 0,999990157
It is possible to weaken restriction on equireliability of edges. It is possible to present each edge in the form of serial connection of certain quantity of equireliable elements. Thus fictitious nodes are entered into network structure. After such transformations values of Zi are defined. If one of elements making an edge fails, then this edge is removed from the network. Network connectivity is checked without considering of fictitious nodes.
It should be mentioned that obtained results may be used in case of failure of nodes when edges are absolutely reliable. Node failure may be modeled by removal of all edges emanating from this node.
This method of the analysis of networks reliability can be used also when those states are only considered as operable at which value of certain network parameters, for example, the network capacity will satisfy to preset values.
For high dimension networks the statistical estimation of values Zi can be defined using a Monte-Carlo method. The random combination of i fault elements is generated, and then operability of a network is checked. The ration of an amount of trials at which the network will appear nonoperable to the total number of trials will be statistical estimation Z,.
However it is necessary to consider that for reaching of high accuracy of statistical estimation Z, the amount of trials should be great enough (105-106).
6 CONCLUSION
Analytical expressions defining reliability factors of networks consisting of identical elements are obtained. These elements fail and repair independently from each other and have exponentially distributed time to failure ad time to repair. Moreover both nodes and edges of the network may be regarded as absolutely reliable. Fidelity of the obtained result is vindicated by calculation of reliability factors of redundant system for which analytical estimations are known.
REFERENCES
1. I.A. Ushakov (1969a). On an evaluation of an average time between failures and an average idle time for systems with a large number of states. "Reliability and quality assurance" magazine. No.3.
2. I.A. Ushakov (1969b). On an evaluation of an average stationary stay time of semi-Markov process in a subset of states. Proceedings of the Academy of Sciences of the USSR. "Engineering cybernetics" magazine. No.4.
3. B.V.Gnedenko, ed. (1983) Aspects of Mathematical Theory of Reliability. Authors: E. Barzilovich, Yu. Belyaev, V. Kashtanov, I. Kovalenko, A. Solovyev, I. Ushakov. Radio I Svyaz, Moscow.
4. B. V. Gnedenko, Y. K. Belyayev, A. D. Solovyov (1965). Mathematical methods in reliability theory, Nauka, Moscow.
5. O.A. Tkachev (2010). Reliability Analysis of Data Transmission Networks. - In: Proceeding of International Workshop '' Distributed Computer and Communication Networks'', Russia, Moscow, (DCCN-2010) pp.276-283.
6. O.A. Tkachev (1999). Network reliability analysis, in Computing systems and networks design theory. Moscow Aviation Institute, Russia, Moscow.
7. G. T. Artamonov (1999). Networks of maximum reliability, in Computing systems and networks design theory. Moscow Aviation Institute, Russia, Moscow.