On Exponential-Weibull Distribution Useful in Reliability
and Survival Analysis
Anakha K. K & Chacko V. M
Department of Statistics, St. Thomas' College (Autonomous), Thrissur Kerala-680 001, India Email: [email protected], [email protected]
Abstract
In this paper, mixture of Exponential and Weibull distributions is considered for modelling real lifetime data. The basic mathematical properties including moments, generating functions, order statistics etc are derived. We obtain the reliability of stress-strength model. The maximum likelihood method is performed to estimate the parameters and a simulation study is conducted to validate the maximum likelihood estimators. The model is fitted to a real data set.
Keywords: Failure rate function, Reliability, Survival Analysis, Stress-Strength
I. Introduction
Lifetime distributions have a significant role in Reliability theory and survival analysis. Exponential, Gamma, Weibull and log-Normal distributions are some of the distributions commonly used for modeling lifetime data. Exponential and Weibull distributions are more popular than Gamma and log-Normal because the survival function of Gamma and log-Normal distributions doesn't have a closed form.
Mixture distributions have been used widely in reliability and survival analysis studies recently. It has been getting great attention, since mixture models are more appropriate and multiple causes of failure can be simultaneously modeled. Due to high flexibility, survival mixture models are better choice to analyze the reliability or survival data in situations when the data are believed to be heterogeneous and a single parametric distribution may not be sufficient to analyze the data.
There are several mixture distributions and generalizations of existing distributions, available in literature. A study on Generalized Lindley distribution useful in reliability study is given by Nadarajah et.al (2011). Nadarajah and Gupta (2007) studied on Exponentiated Gamma distribution with application to drought data. Mustafa et.al (2016) proposed Weibull Generalized Exponential Distribution. Gupta and Kundu (2001) studied on Exponentiated Exponential family as an alternative to gamma and Weibull. A detailed study on Statistical Models and Methods for Lifetime Data can be seen in Lawless (2003). Chacko et. al (2018) proposed Weibull-Lindly Distribution for modeling a bathtub shaped failure rate data.
In section 2, the mixture of Exponential and Weibull distributions is considered. The failure rate or hazard rate function is given in section 3. Moments are given in section 4 and generating functions are given in section 5. Conditional moments are given in section 6 and Quantile function is given in section 7. Mean deviation is given in section 8 and distributions of order statistics are
given in section 9. Bonferroni and Lorenz Curves are given in section 10. Reliability in stress-strength model is given in section 11. Estimation of parameters using maximum likelihood estimation method is described in section 12. Simulation study and real data analysis are given in section 13 and 14 respectively. Conclusions are given in last section.
2. Exponential-Weibull distribution
Here we consider the mixture of two lifetime distributions, namely Exponential and Weibull distributions. The cumulative distribution function (cdf) of mixture of Exponential and Weibull distribution can be represented as
F(x) = 9Fe(x) + (1-9)Fw(x), where 9 = A/(l+A), A > 0, FE (x) = 1 — e~A, A > 0, x > 0, the cdf of Exponential distribution with
scale parameter A and F^ (x) = 1 — e , A > 0,, a > 0, x>0, the cdf of Weibull distribution with scale parameter A and shape parameter a. The mixture of Exponential and Weibull distributions can be written as,
1 Ae~A + e ~(Ax c
F (x) = 1--—--. (1)
1 + A
The corresponding probability density function(pdf) is
„ , AAe~A +ccAaxa-le-{Ac
f ( x) =-—-. (2)
1 + A
Let X be a random variable, then we say that X has a 'Exponential-Weibull distribution' (EW(a, A)) with scale parameter a and shape parameter A, if it has the pdf (2).
The reliability function of the EW(a, A) distribution is
— — + e c
f (x)=A^r±e—,
1 + A
and corresponding failure rate or hazard rate function is
AeA + cAcxc—1ec h( x) =-;-7TC-. (3)
AeA + e-(Ax c
The mode for the mixture model EW(a, A) can be found by solving the derivative of the (2)
, — A3e— c2A2cx2c-2e-{Axc + c(c — 1)Acxc—V(Ax)c
f' ( x) =-—-(-)--(4)
1 + A
By solving (4), we observe that the mixture model EW(a, A) is unimodal. Figure l shows the pdf of EW(a, A) for various choices of parameters.
~i-r
0.0 0.2 0.4 0.6 0.8
1.0
Figure 1: pdf f(a, A) of EW(a, A) for values of parameters f1(x) = f(0.2, 0.4), f2(x) = f(1, 2), f3(x) = f(5, 10) and f4(x) = f(10, 2)
3. Failure Rate Function The failure rate or hazard rate function of the mixture EW(a, A) distribution is given as follows:
h( x) =
A2e^ + aÂaxa-1e ^ a
A> 0,a > 0, x > 0.
fo-tx + e-(*x r
EW(a, A) distribution has increasing, decreasing, upside-down bathtub shape behaviors for its failure rate. When a = 1; h(x) = A, a constant, i.e. it has Exponential distribution with lack of
memory property. From (3) lim 0 h(x) =-, a constant and lim h(x) = 0 . Figure 2
A +1
shows the failure function of EW(a, A) distribution with various choice of parameters. These shapes of failure function show that EW(a, A) distribution fit in with both monotonic and nonmonotonic behaviors which are more likely to be come across when dealing with lifetime data.
« i=
Figure 2: Failure function of EW(a, A) distribution for various choice of parameters h1(x) = h(0.2, 0.6), h2(x) = h(1, 2), h3(x) = (2, 3) and h4(x) = (4, 2.5) for h(a, A)
x
0.0
0.2
0.4
0.6
0.8
1.0
4. Moments
The rth raw moment of the EW(a, A) distribution with pdf in (2) is given by,
œ
jur = E ( Xr ) = J xrf ( x)dx
œ I -Ax . „.ia,,a-\„
r I A e + rA x e
-A r )
= J x
0
1+ A
dx
Ar!+rl 1 + -
a
The first four raw moments are,
1 ^
A + rl 1 + -
2A + rl 1 +
(1 + A)Ar 2 ^
E ( X ) =
a
(1 + A)A
, E(X2) = ■
a
(1 + A)A
E (X4) =
24A + rl 1 +
4N V r
-, E(X3) =
4
6A + r|1 + —
^ V r
(1 + A)A
and
(1 + A)A4
The variances of EW(a, A) distribution is given by;
f
(1 + A)
Var ( X ) =
2A + rI 1 +
2
f
—
r
A + rl 1 +
r
(1 + A)2 AA
Central moments can be obtained using raw moments.
5. Generating Functions
Let X be a random variable with probability density function (2). Its moment generating function (mgf) is given by,
I AVa +cAcxc-1e-(Acc )
E(etx ) = J e'tx
1 + A
dx
A2
+ ■
1
œ f
z
(1 + A)(A -1 ) 1 + A n=0 Ann!
-r
1 + n
V rj
, a > 1, t < A.
The characteristic function (cf) of X is <(t ) = E (e'tx ), which is
^ AAe~A +aAaxa-1e-(Ax
E (e'tx ) = J e'tx
A2
1 + A 1
dx
M
(1 + A)(A - it) 1+ AZ Ann!
j
f
■ +
—z
1 + A „=n
r
n
1 +
V r
0
r
2
1
0
0
The cumulant generating function (cgf) of X is given by, Kx(t) = log(t)
= log( T+l) +log
' A2 it rF|1+ -
_ +
A - it n=0
t0 A'n! I
a
6. Conditional Moments
The conditional expectation for the EW(a, A) distribution is given by,
x
E(Xn / X > x) =
A e+aAaxa-1e" 1+ A
dx
F ( X )
1 + A
7 , , rl - +1, (A)a
A2 r(n +1, Ax ) lg v '
Ae+ e-(Ax )a (1 + A)A'
+
(1 + A)An
Ax : a-(Ax)a
Ae-A + e
( , rl- +1, (A)a r(n +1, Ax) la V '
A'
n-1
An
In particular,
E(X / X > x) =
1
A 4- o-(Ax)a
AeA + e
rf - +1, {A)a
r(2, Ax )+^a
A
E ( X2 / X > x) =
+ e-(Ax T
r(3,Ax )|rfa + ^
A
A2
E(X3 / X > x) = ■
1
A 4_ o-(AxT
Ae-A + e
r(4, Ax^rfa+1W
A2
E(X4/ X > x) =
1
-A 4- o-(AxT
Ae-A + e
A3
4
and
rjA, rfT+(AT
A3 + A4
x
<
1
<
<
>
1
>
<
>
Anakha K K & Chacko V M RT&A, No 1 (56)
ON EXPONENTIAL-WEIBULL DISTRIBUTION Volume 15, March 2020
7. Quantile Function The pth quantile say Q(p), p e (0,1) is defined by Q(p) = p. Let X be a EW random variable with pdf
(2), then its quantile function is the root of the equation,
Ae—(p) + e-(Ag( )c _
1 + A = P
(1 — p )(1 + —) = — — p) + e-(AQ( P ) c
8. Mean Deviation
The average amount of scatter in a population from either the mean or the median is termed as mean deviation. The mean deviation about mean and, mean deviation about median are defined by,
ro
S (X ) = J|x — u\f (x)dx
0 and
ro
S2( X) = J|x—M | f (x)dx,
espectively, where ^ and M are mean and median respectively. S1 (X) and S2 (X) can be
r
calculated using the formulae,
ro
I — 2u + -
M
ro
= —u + :
S (X) = 2yF (m) — 2m + 2J xf (x)dx
M
ro
and S (X) = — M + 2 J xf (x)dx respectively.
M
The mean deviation about mean of the EW(a, A) distribution is,
( —Am . —(Au)c ^ ro ( .2 —Ax,„ c c—1 —(Ax)c A
, Ae M + e yM' „ „r|Ae +cA x e — - — 2u + 2J
Sj (X) = 2m
1 — -
1+ A
v
1 + A
M
dx
2 (a2 ue ~¥ +Me~{AuT) 2(Aye ~¥ + e
r
2r
v
-+1, AM)
c
_l(e -¥ —ye-{AmT)
A(1 + A) 1 + A A(1 + A)
f
2r
+ Vc
-+1, AM)c
1+ A A(1 + A)
0
where /(a,x) = Jta 1e *dt and r(a,x)=Jta 1e *dt are denotes the incomplete gamma
0 x
functions. The mean deviation about median of the EW(a, A) distribution is,
( fo e~fo + afo xa- e(fo )al
S2 (X) = -u + 2 J x
M
1+ fo
y
2
c
fo(1 + fo)
i
1 +1, (Mfo)a
A2 Me+ fo+ r
Va
9. Order Statistics
Let X1, X2,.....Xn be a random sample of EW(a, A) distribution with cdf and pdf as in (1) and (2)
respectively. Their corresponding order statistics is denoted by X(1) < X(2) < X(3)......< X(n) . The pdf
and cdf of the rth order statistic are,
n!
fx (r) (xrfo)
'(r - l)(n - r )•
n!
(r - 1)!(n - r)!
f (x;a,X)Fr 1 (x;afo)F (x;afo)
I n -r|(- 1);Fr-1+/(x;afo)f(x;afo)
1=0 V l y
and
n
Fx,) =I .FJ (x;afo){1 - F (x;afo)}n-J
V J y
n n- J I
n
Vn - j
v / y
(- 1)FJ+/ (x;afo),
=IIL
J=r /=0 VJ
The pdf of X(r) of EW(a, A) distribution is ,
nfae^ + afoxa-1e~(fo:^ ^ (-1) ' (r -1)(1 + fo) 1 /!(n - r - /)
afo), respectively for r = 1,2,3..n.
fx
1-
fo-A. + e~(Ax )a
1 + fo
and corresponding cdf is
Fx„(x;afo) = lHn n-J|(-1)/j!-
Vn-
J = r /=o
/=o VJ y
n - J
fo-fo + e-(fo T
J+/
1 +
r-1+/
The pdf of 1st order statistic is
fXh. =
^ (fo e-fo +afo xa-1e-{fo )a X(1) (1 + fo)V
(-1) 1 /!(n -1
^ j- ¿>-(foT
/!(n -1 - r)
1-
foe+ e
1 + fo
and the pdf of nth order statistic is
fx =t^ (foe-fo + afoaxa-V(fo )a )1 (n) (1 + fofy A
1 + fo-foTfo -e
,-fo _ 1o-(fox )T
The distribution of order statistics can be used for obtaining reliability of series or parallel system.
x
j = r
10. Bonferroni & Lorenz Curves Bonferroni and Lorenz curves are the popular tools for analyzing data emerging in Economics and Reliability. They are the fundamental tool for income analysis. The Bonferroni and Lorenz curves are defined by
and
respectively, or correspondingly by
and
i q
B(p) = — J xf (x)dx PM o
i q
L(P) = - f xf (x)dx
m 0
i p
B(p ) = — f Fx)dx
pm 0
i p
L(p) = - J F — (x)dx
M
respectively, where ^ = E(X) and q = F-1(p). For a random variable X with pdf (2), the Bonferroni and Lorenz curves are
rf1 +1, (Aq )c
b(P >=Mb)«1—(*+1)e"*)
+ ■
c
/1
and
l(P )=M(r+A)'(1 +1)e—)
rf1 +1, (Aq )c
Vc
+ —-
A
respectively.
11. Stress-Strength Reliability
Let X1 and X2 be two independent random variables, where X1 represents the strength and X2 represents the stress. Suppose X1 and X2 follows EW distribution, with parameters (a1, A1) and (a2, A2) respectively. Then the system reliability is
0
R = P[X> > X2] = J f (x% (x)dx
0
A e ~Ax xa1 -1e ~(Ax)
J
1+ A
x fV a e A + e )a2^ 1 —2
1+A
2
dx
(1 + A )(1 + ?2 ) 0
A(1 + A)- AA
(1 + A )(1 + ? )
2e? + a? x*1 -1e"(?1xa + A - Ae A - e)dx
^AA r(a2i +1) , +> „ +(1+A)+
(A, + A)
i=0
(A)"
> ^—'-—— ría i + aJ+>
/ ^ -i 2' * *
i=0 i! ?2 i=0
(-1)+1
A
v? y
r
—2 i + 1 Va1 J
ro
0
1
1
>
12. Estimation of Parameters
If X1, X2,....Xn is a random sample from EW(a, A), then its likelihood function is
n 1,2 —Axi —(Ax )c
L(x; c, A) = TT Ae +cAxie— .
It is familiar that maximum likelihood estimate of the parameters is the value of the parameter which maximizes the likelihood function. The partial derivatives of log L with respect to unknown parameters a and A are,
d logL(x;c, A) _ ^ Aaxc—1e~— c {c logc(c — 1) — cAaxc log(Ax,) +1} 5c A2e— + cAcxc—11e~— r
' , and
a log L(x.;a,A)_ £ Ae"Ai [(1 + A)(2 - x,. A)-?] + a(lxi )a-1 e "(Ax a [a(1 + A)(1 - (? )a)- A
~A (1 + A)A2e A + aAaxa-1e a)
By equating the above equations to zero we get two non-linear equations.
^ xa-1e-(Áx')a {alogaja - l)- aAaxa log(Ax; )+l} _ Q and
it A2 e + aAa xa-1e ~(Ax')a
AeA [(1 + ?X2 - x?)-?]+ a(Ax.)a-1 er [a(1 + ?)(1 -(Ax,)a)-A
0.
n
y ,
tí (1 + A\Ae~A + aAaxa-1e-{Äx-a
A solution of the non-linear equation gives the maximum likelihood estimates of a and A. The normal equations cannot be solved by analytically, so that Newton-Raphson's iteration method or any other numerical approximation methods are required. Since the maximum likelihood estimator (MLE)s cannot be expressed in explicit forms, we consider their asymptotic distribution
and confidence interval for a > 0 and A > 0. For large samples, the MLE I a, A | of (a, A) is
asymptotically normal with mean zero and variance covariance matrix I-1, where
I =
E
v f
E
d2 logL
da2 82 log L' dfoda
E
E
d2 logL dadfo 52 logL' dfo2
We can derive the approximate (1- 5)100% confidence interval of the parameters a and A. By using
A
variance covariance matrix, the confidence intervals are a± Z
var
a I and
var
fo± Z distribution.
fo | where Zn is the upper 100| — | percentile of the standard normal
■if J
13. Simulation Study
Here we performed a simulation study to validate the maximum likelihood estimation procedure for EW(a, A) distribution using Newton-Raphson method. For this purpose, we generated samples of sizes 25, 50, 100, 500, 1000 for different combinations of a and A. We computed the maximum likelihood estimates for each sample and repeated this process thousand times then computed the bias and mean square error(MSE)s of the parameter estimates.
The simulation is conducted for the selected values of a and A. Here we considered a = 1 , A = 1 ; a = 1 , A = 1.2 ; a = 2 , A = 1.5 and a = 3, A = 1.5 as initial parameter values. The Tables 1, 2, 3 and 4 gives the values of the estimates, bias and MSEs of the corresponding parameters. From the tables, it can be seen that, as sample size increases the bias and MSE of the estimates decreases.
Table 1: Estimates, Bias and MSE for a = 1 and A = 1
A
A
N Estimates Bias MSE
25 a = 1.293439 A = 1.038358 0.2934385 0.03835775 0.81239 0.05436978
50 a = 1.087168 A = 1.019775 0.08716756 0.0197749 0.0942621 0.0244342
100 a = 1.043303 A = 1.009724 0.08716756 0.0197749 0.0942621 0.0244342
500 a = 1.011771 A = 1.000085 0.01177124 8.461249e-05 0.005638945 0.002088029
1000 a = 1.006076 A = 1.000816 0.006076378 0.0008162859 0.002621145 0.001182026
Table 2 Estimates, Bias and MSE for a = 1 and A = 1.2
N Estimates Bias MSE
25 a = 1.275423 A = 1.250335 0.2754229 0.05033506 0.7921809 0.07960997
50 a = 1.123425 A = 1.227627 0.1234254 0.02762709 0.1598412 0.03496351
100 a = 1.054752 A = 1.203634 0.1234254 0.02762709 0.1598412 0.03496351
500 a = 1.008541 A = 1.198176 0.008540759 -0.001823803 0.00660528 0.003205042
1000 a = 1.006074 A = 1.202171 0.006073959 0.002171475 0.003285917 0.001605966
Table 3: Estimates, Bias and MSE for a =2 and A = 1.5
N Estimation Bias MSE
25 a = 2.737266 A = 1.53967 0.7372658 0.03966983 6.287694 0.07501019
50 a = 2.304472 A = 1.512797 0.3044722 0.01279734 1.261294 0.03309165
100 a = 2.122918 A = 1.508089 0.122918 0.008089073 0.3660359 0.01584789
500 a = 2.029113 A = 1.50079 0.0291133 0.0007895803 0.04937387 0.003182324
1000 a = 2.012721 A = 1.500226 0.01272087 0.0002263662 0.02528183 0.001405722
Table 4: Estimates, Bias and MSE for a = 3 and A = 1.5
N Estimates Bias MSE
25 a = 4.016253 A = 1.514296 1.016253 0.01429607 10.68158 0.04879727
50 a = 3.388975 A = 1.508105 0.3889748 0.008105423 1.99424 0.02228412
100 a = 3.177544 A = 1.504892 0.1775441 0.004891745 0.7521783 0.01055644
500 a = 3.019977 A = 1.498705 0.01997658 -0.00129502 0.09545959 0.00199238
1000 a = 3.007664 A = 1.500396 0.007664266 0.0003962086 0.04595611 0.001004574
14. Data Analysis
In this section, we considered a real data set of survival times (in days) of 72 guinea pigs infected with virulent tubercle bacilli, observed and reported by Bjerkedal (1960). This data set is used to explain the supremacy of the EW(a, A) distribution. The data are presented in Table 5:
Table 5: Data set of survival times of 72 guinea pigs infected with virulent tubercle bacilli
12 48 60 75 109 258 15 52 60 76 110 258 22 53 61 76 121 263 24 54 62 81 127 297 24 54 63 83 129 341 32 55 65 84 131 341 32 56 65 85 143 376 33 57 67 87 146 34 58 68 91 146 38 58 70 95 175 38 59 70 96 175 43 60 72 98 211 44 60 73 99 233
We fitted EW(a,A) distribution to the given data set and compared the results with Exponential distribution and Weibull distribution. The comparison is carried out based on the values of Kolmogorov-Smirnov (K-S) statistic, p-value, log-likelihood value, Akaike information criterion (AIC) and Bayesian information criterion (BIC).
The Kolmogorov-Smirnov test is defined by:
D = max | F Y ) - —, — - F (Y )|
V N N Vi/J
where F is the cdf of the distribution and N is the number of classes being ordered. The AIC and BIC are defined by
AIC = 2k - 2ln^ Lj and BIC = ln(n)k - 2 ln^ L j
A
where L denote the maximum value of the likelihood function and k is the number of parameters and n is the sample size. By comparing the values of K-S statistic, p-values, likelihood value, AIC and BIC, Table 6 shows that EW(a, A) distribution has smallest K-S statistic value and largest p-value among them. The AIC and BIC values of EW(a, A) distribution indicates that the amount of information lost by the model is less than that of Exponential and Weibull distributions. This points out that the proposed model provided a better fit as well as more precise estimates.
Table 6: Parameter Estimates, log-likelihood, p-value, AIC and BIC values of model fitted
MODEL PARAMETER ESTIMATES LOG LIKELIHOOD K-S STATISTIC P- VALUE AIC BIC
MEW a=1.396574502 b=0.009055284 -397.1651 0.1459 0.09327 798.3302 802.8836
WEIBULL a=1.393170797 b=0.009045585 -421.9635 0.14645 0.09114 847.927 852.4804
EXPONENTIAL b= 0.01001842 -422.1097 0.2116 0.003168 846.2193 848.496
15. Conclusions
In this paper, the mixture of Exponential and Weibull distributions are considered. The failure rate or hazard rate function is given. Moments, generating functions, Conditional moments, Quantile function, Mean deviation, distributions of order statistics, and Bonferroni and Lorenz Curves are obtained. Reliability in stress-streangth model is computed. Method of estimation of parameters using maximum likelihood estimation method is described and a simulation study is conducted to validate the maximum likelihood estimation procedure. A real data analysis is given, which shows the mixture distribution, EW(a, A) distribution is a better alternative in certain situations of reliability and survival analysis.
References
[1] Nadarajah, S., Bakouch, H.S., and R. Tahmasbi (2011). A generalized Lindley distribution. Technical Report, School of Mathematics, University of Manchester, UK
[2] Nadarajah, S., and A.K. Gupta (2007). The exponentiated gamma distribution with application to drought data. Calcutta Statistical Association Bulletin 59, 233-234.
[3] Abdelfattah Mustafa, Beih S. El-Desouky and Shamsan AL-Garash (2016). Weibull Generalized Exponential Distribution. Department of Mathematics, Faculty of Science, Mansoura University, Mansoura 35516, Egypt.
[4] Bjerkedal T (1960). Acquisition of resistance in guinea pigs infected with different doses of virulent tubercle bacilli. Am J Hyg 72, 130-148.
[5] Gupta, R. D. and Kundu, D. (2001). Exponentiated Exponential family; an alternative to gamma and Weibull. Biometrical Journal, 43, 117-130
[6] Lawless, J. F. (2003). Statistical Models and Methods for Lifetime Data. John Wiley and Sons, New York.
[7] Chacko V.M., Deepthi K S, Beenu Thomas, Rajitha C (2018) Weibull-Lindly Distribution: A bathtub shaped failure rate model, Reliability: Theory and Applications, Vol.13, pp.9-20, No.4(51).