Estimation procedures for a flexible extension of Maxwell distribution with data modeling
Abhimanyu Singh Yadav*1, H. S. Bakouch2, S. K. Singh3 and Umesh Singh4
•
department of Statistics, Banaras Hindu University, Varanasi, India.
E-mail: [email protected], [email protected] [email protected] 2Department of Mathematics, Faculty of Science, Tanta University, Tanta, Egypt. E-mail: [email protected] * Corresponding Author
Abstract
In this paper, we introduce a flexible extension of the Maxwell distribution for modeling various practical data with non-monotone failure rate. Some main properties of this distribution are obtained, and then the estimation of the parameters for the proposed distribution has been addressed by maximum likelihood estimation method and Bayes estimation method. The Bayes estimators have been obtained under gamma prior using squared error loss function. Also, a simulation study is gained to assess the estimates performance. A real-life applications for the proposed distribution have been illustrated through different lifetime data.
Keywords: Family of Maxwell distributions, Entropy, Classical and Bayes estimation, Interval estimation, Asymptotic confidence length.
1. Introduction
The Maxwell distribution has broad application in statistical physics, physical chemistry, and their related areas. Besides Physics and Chemistry it has also a good number of applications in reliability theory. At first, the Maxwell distribution was used as lifetime distribution by [1]. The inferences based on generalized Maxwell distribution have been discussed by [2]. [3] considered the estimation of reliability characteristics for Maxwell distribution under Bayes paradigm. [4] discussed the prior selection procedure in case of Maxwell distribution. [5] studied the distributions of the product |XY| and ratio |X/Y|, where X and Y are independent random variables having the Maxwell and Rayleigh distributions, respectively. [6] proposed the Bayesian estimation of the Maxwell parameters. [7] discussed the estimation procedure for the Maxwell parameters under progressive type-I hybrid censored data. Furthermore, several generalizations based on Maxwell distribution are advocated and statistically justified. Recently, two more extensions of Maxwell distribution has been introduced by [8], [9] and discussed the classical as well as Bayesian estimation of the parameter along with real-life applications.
A random variable Z follows the Maxwell distribution (MaD) with scale parameter a, denoted as Z ~ MaD (a), if its probability density function (PDF) and cumulative distribution function (CDF) are given by
4 3 2
f (z, a) = a2z2e-az z > 0, a > 0 (1)
and
F(z, a) = -2= r( 2, az2), (2)
respectively, where T(a, z) = /0 pa-1e-pdp is the incomplete gamma function.
In this article, we propose a flexible extension of the Maxwell distribution. The objective of this article is to get some main properties of this distribution for showing its merit in modeling various practical data, and then estimate the unknown parameters using classical and Bayes estimation methods. Other motivations regarding the advantages of the distribution comes from its flexibility to model the data with non-monotone failure rates. The former aim is justified, where the proposed distribution provides better fit to the reliability/survival data comparing to the some known and recent versions of the Maxwell distribution. Further, the distribution is that having the nature of platykurtic, mesokurtic and leptokurtic, hence it can be used to model skewed and symmetric data as well. Also, the Bayes procedure under informative prior provides the more efficient estimates as compared to the maximum likelihood estimates (MLEs) concerning the estimation point of view. Another motivation for the confidence interval of the distribution parameters is that increasing the sample size decreases the width of confidence intervals, because it decreases the standard error, and this justified by simulation study and using sizes of four practical data sets.
The reminder of the considered work has been structured in the following manner. Section 2 provides some statistical properties related to the proposed model for purpose of data modeling. In Section 3, some types of entropy are investigated. The maximum likelihood (ML) and Bayes estimation procedures have been discussed in Section 4. Also, a simulation study is carried out to compare the performance of Bayes estimates with MLEs. In Section 5, we illustrate the application and usefulness of the proposed model by applying it to four practical data sets. Section 5 offers some concluding remarks.
2. The model and some of its properties
This section provides another generalization of the MaD using power transformation of Maxwell random variates for estimations issues of the distribution parameters and modeling practical
1
data. For this purpose, consider the transformation X = Z p, where Z ~ MaD (a), hence the resulting distribution of X is called as power Maxwell distribution (for short PMaD) and denoted by X ~ PMaD (a, p), where, a and p are the scale and shape parameters, respectively. The PDF and CDF of the PMaD are given by
f (x, a, p) = a3px3p-1e-ax2p, x > 0, a, p > 0, (3)
V n
F(x, a, p) = -2= r(2, ax2p) , (4)
respectively. Plots of the PDF are given by Figure 1 for different choices of a and p. The plots show different kurtosis, positive skewness and symmetric shapes.
Some main mathematical and statistical properties of PMaD have been obtained in the following.
2.1. Behaviour with some reliability functions
This subsection, described the asymptotic nature of density and survival functions for the proposed distribution. To illustrate asymptotic behaviour, at first, we will show that lim f (x, a, ß) = 0
and lim f (x, a, ß) = 0 . Therefore, using (2.1)
lim f (x, a,ß) = ^a3ß lim x3ß-1 e-ax2ß = 0,
x^0 s/n x^0
Density plot
Hazard function plot
2.0 -
1.5 -
X 1.0-
0.5 -
0.0 -
0.5
I
1.0
— a=0.75 , p = 0.75
— a= 1 , p = 0.75 a=0.75 , p = 1.5 a= 1.5 , p = 1.5 a=2,p=2
I
1.5
I
2.0
1.5 -
1.0 -
>x
I
0.5 -
0.0 -
x
x
Figure 1: Density function and hazard function plot for different choices of a and f>.
and
lim f (x, a, ß) = —=a2ß lim x3ß-1 lim e-ax2ß = 0
Wn x^m x^m
The characteristics based on reliability function and hazard function are very useful to study the pattern of any lifetime phenomenon. Let X be a random variable with PDF (2.1) and CDF (2.2), different reliability measures for the proposed distribution are obtained by following equations.
The reliability function R( x) is given by
R(x) = P(X > x) = 1 - —= r (3,ax2ß^j
(5)
The mean time to system failure M( x) is
M(x) = E(x) = 1) ' r( ^
The hazard function H( x) is given as
H(x)
f(x, a, ß) _ 4a2ßx3ß-1 e-ax2ß
1 - F(x, a,ß) - 2f ax2ß)
(6)
(7)
The plots, in Figure 1, show that the proposed density is unimodel and positively skewed with monotone failure rate function for the different combination of the model parameters. The comparative behavior of the random variables can be measured by stochastic ordering concept
that is summarized in the next proposition.
Proposition: Let X ~ PMaD(a1, 01) and Y ~ PMaD(a2, 02), then the likelihood ratio is
O
fx(x) = ((01] x3(pi-^2)e-(«ix201 +«2x202).
fY (x)
a2
Therefore,
O
log f fx^^ = I
H fY (X)J X
02
3(01 - 02) - («1x201 + a2x202)
If pi = p2 = p, then O < 0, which implies that the random variable X is a likelihood ratio order than Y, that is X </r Y. Also, if a1 = a2 = a and p1 < p2, then again O < 0, which shows that X <ir Y. Other stochastic orderings behaviour follow using X </r Y, such as hazard rate order (X <hr Y), mean residual life order (X <mr/ Y) and scholastically greater (X <st Y).
2.2. Moments and some conditional ones
Let x1, x2, ■ ■ ■ xn be random observations from the PMaD(a, p). The rth moment, \lr, about origin is
r .
ur = I xrf(x, a, p) dx = ( 1 | p r ( 3p + r Jx=0 Jy HJ -n\a) V 2p
The coefficient of skewness and kurtosis measure the convexity of the curve and its shape. Using the moments above, the two earlier measures are obtained by moments based relations suggested by Pearson and given by
r > 1.
01
v3 - 3v2 v1 +2 (vi)
v2 - (v1)2
and
02
V4 - 4V3+ 6V2 - 3
v2- (v1)
Numerical values of some measures above are calculated in Table 1 for different combination of the model parameters, and it is observed that the shape of the PMaD is right skewed and almost symmetrical for some choices of a, p. Also, it can has the nature of platykurtic, mesokurtic and leptokurtic, thus PMaD may be used to model skewed and symmetric data as well.
The mode (M0) for PMaD (a, p) is obtained by solving the following expression -—f (x, a, p) |M0
dx
0, which yields
M0 = (32ipi) * •
Moreover, the median ( Md) of the proposed distribution can be calculated by using the empirical relation among the mean, median and mode. Thus, the median is,
1 2 / 1
Md = g Mo + 3 = 3
30 - 1N 20 4 2a0 ) + Tn
30 + 1 20
The moment generating function (mgf) MX(t) for a PMaD random variable X is obtained as
2 M 1 / t Mx (t) = E(etx ) = ^ j 02?
r 30 + r
2
3
2
4
2
1
r
a
Table 1: Values of mean, variance, skewness, kurtosis, mode and coefficient of variation for different a, f
a, i ft ft il f 2 x0 CV
when a fixed and f varying
0.5, 0.5 3.0008 5.9992 2.6675 7.0010 1.0000 0.8162
0.5, 1.0 1.5962 0.4530 0.2384 3.1071 1.4142 0.4217
0.5, 1.5 1.3376 0.1499 0.0102 2.7882 1.3264 0.2894
0.5, 2.5 1.1780 0.0445 0.0481 2.7890 1.2106 0.1792
0.5, 3.5 1.1204 0.0211 0.1037 2.4351 1.1533 0.1298
when i fixed a varying
0.5, 0.75 1.9392 1.1443 0.7425 3.8789 1.4057 0.5516
1.0, 0.75 1.2216 0.4541 0.7425 3.8789 0.8855 0.5516
1.5, 0.75 0.9323 0.2645 0.7425 3.8789 0.6758 0.5516
2.5, 0.75 0.6632 0.1338 0.7425 3.8789 0.4807 0.5516
3.5, 0.75 0.5299 0.0855 0.7425 3.8789 0.3841 0.5516
when both varying
1, 1 1.1287 0.2265 0.2384 3.1071 1.0000 0.4217
2, 2 0.8723 0.0372 0.0102 2.7895 0.8891 0.2212
3, 3 0.8484 0.0163 0.0831 2.6907 0.8736 0.1506
4,4 0.8509 0.0094 0.1069 1.9643 0.8750 0.1140
5, 5 0.8586 0.0062 0.0677 0.1072 0.8805 0.0915
For lifetime distributions, the conditional moments are of interest in prediction. Another application of conditional moments is the mean residual life (MRL). For this purpose, let X observed from PDF(2.1), the conditional moments, E(Xr |X > k) and the conditional mgf E(etx|X > k) are obtained as follows;
E(Xr|X > k)
L>kxrf a f)dx L>kf (x, a i)dx
2(i)2i r
and
Jxlv^1A_ L>ketxf (x, a,i)dx
E(e |X > k)
L>kf (x a i)dx
/ \ JL t_ f l\ 2i
^i=0 i!
3f + r 2f
, ak2i
n - 2r (3, ak2i)
2 " r
3f + r 2f
, ak2i
n — 2r (§, a.k2P) '
respectively. The MRL is the expected remaining life X — x, given that the equipment has survived to time k. The MRL function in terms of the first conditional moments is given as
2(±r r
m(x) = E[X - x|X > k]
3f + 1 2f
, ak2i
n - 2r (3, ak2i)
- x
3. Entropy measurements
In information theory, entropy measurement plays a vital role to study the uncertainty associated with the random variable. In this section, we discuss the different entropy measures for PMaD. For more detail about entropy measurement, see [10].
3.1. Renyi entropy
Renyi entropy of a r.v. X with PDF (2.1) is given as
Re
(1-i)
ln
4 3
I 1 "r=a2
-1e-xx2P I dx
Hence, after some algebra, we get
D 1 N, „ A, ,, a 1 -A - 20. 3A0 - A + 1 ^30A - A + 1
Re = (I-i) Aln4 - 2ln n + Aln0 lna -ln A + H 20-
1
3.2. A-entropy
The A entropy is also known as p entropy. The A entropy for a random variable X having PDF (2.1) is defined as
Ae
1
A- 1
1 - fA (x, a, 0)dx Jx=0
Using PDF (2.1) and after simplification, the expression for p-entropy is given by;
Ae
A- 1
1 -(-40a a
1 - A - 20 /rf 3A0 - A +
20
20
3A0 - A + 1
V A 20
(8)
3.3. Generalized entropy
The generalized entropy is defined by
G = va«-A ~ 1 . A = 0 . Ge = A(A - 1) 'A = 0,1,
where, vA = Jx°=0 xAf (x, a, 9)dx and « = E(X). After some algebra, we get
ge = . ^
30 + A ~2f~
r
30 + 1 20
-A
A(A - 1)
A = 0,1.
(9)
4. Parameter estimation with a simulation study
Here, we describe the maximum likelihood estimation method and Bayes estimation method for estimating the unknown parameters a, p of the PMaD. The estimators obtained under these methods are not in nice closed form; thus, numerical approximation techniques are used to get the solution. Further, the performances of these estimators are studied through a Monte Carlo simulation.
4.1. Maximum likelihood estimation
The most popular and efficient method of classical estimation of the parameter(s) is maximum
likelihood estimation. The estimators obtained by this method passes several desirable properties
CO
1
r
such as consistency, efficiency etc. Let Xi, X2, ■ ■ ■ , Xn be an iid random sample of size n taken from PMaD (a, ft), then the likelihood function is
L(a, 6) = ft -4=a2ftxf-1 e-ax? = aZ=i *? (n*^) ,
hence the corresponding log-likelihood function is written as
n 3 n n 2 n
ln L(a, 6) = l = n ln4 - - ln n + — ln a + n ln ft - a Z x-ft + (3ft - 1) Z ln xi. (10)
2 2 i=i i=i dl dl
The MLEs of a and ft are the solution of — = 0 and — = 0, hence
da oft
3n
n
3n - E x2P = 0 (11)
2a
i=1
n n 9« n
- — 2a £ xf ln xi + 3 £ ln xi = 0. (12)
P i=1 i=1
The MLEs of the parameters are obtained by solving the two equations above simultaneously, and non-linear maximization techniques is used to get the solution.
4.1.1 Uniqueness of MLEs
The uniqueness of the MLEs discussed in the previous section can be checked by using following propositions.
Proposition 1: If ft is fixed, then a exists and is unique.
3n 2P
Proof: Let La = ---£=1 x; P, since La is continuous and it has been verified that lim La = to
2a i a^o
and lim La = — £=-1 x2 < 0. This implies that La will have at least one root in interval (0, to)
a^TO 11 i 1
and hence La is a decreasing function in a. Thus, La = 0 has a unique solution in (0, to). Proposition 2: If a is fixed, then ft exists and is unique.
Proof: Let La = - - a En
p = P — a En=i x2p ln xi + 3 E"=i ln xi, since Lp is continuous and it has been verified that lim Lp = to and lim Lp = —2 E,"=i ln xi < 0. This implies, as above, ft exists and it
P P^to p i=1 r
is unique.
4.1.2 Fisher Information Matrix
Here, we derive the Fisher information matrix for constructing 100(1 — Y)% asymptotic confidence interval for the parameters using large sample theory. The Fisher information matrix can be obtained, by using equations (4.2) and (4.3), as
'laa lap
I (a, ft ) = —E I I (5.2.1)
where,
\lpa lpp/ (a,p)
3n n n n
- 2"2, laf> = -2 E xfln xi, lpp = - - 4a E xf (ln xi )2. 2a i=1 P i=1
respectively.
The above matrix can be inverted and the diagonal elements of I-1(a, p) provide the asymptotic variance of a and p, respectively. Now, two sided 100(1 — Y)% asymptotic confidence interval for a, p can be obtained as
a e [a — Z1—t \Jvar(a), a + Z1—t\Jvar(a)], p e [p — Z1—t jvar(p), p + Z1—t jvar(p)],
4.2. Bayes estimation
In this subsection, the Bayes estimation procedure for the PMaD parameters has been developed. Here, we consider two independent gamma priors for both shape and scale parameter. The considered prior is very flexible due to its flexibility of assuming different shape. Thus, the joint prior g(a, p) is given by;
g(a, p) a aa—1 pc—1 e—ba—dp ; a, p > 0, (13)
where a, b, c and d are the hyper-parameters of the considered priors. Using likelihood function of PMaD and equation above, the joint posterior density function n(a, p|x) is derived as
n(a, p\x)~ L(x|a,p)g(a,p)
Lip L(x\a p)g(a, p)da dp
a 3n+a—1 pn+c—1e—«(b+E,n=1x2p) e—dp rnn=1 x3p—1 ) (14)
L fpa3n+a—1 pn+c—1e—'T+E'=1 xOe—(nn=1 xfp—^ da dp
In the Bayesian analysis, the specification of proper loss function plays an important role. We talk most frequently used the square error loss function (SELF) to obtain the estimators of the parameters, which defined as
L($, $) a ($ — $)2, (15)
where $ is estimate of $. Bayes estimators under SELF is the posterior mean and evaluated by
$SELF = [E($\x)] , (16)
provided the expectation exist and finite. Thus, the Bayes estimators based on equation no. (4.5) under SELF are given by
kbs = Eap\x(a\p,x) = n—11 pa3?+apn+c—V«^1 ^e—dp (j^^ da dp, (17)
and
pbs = Epx(p\a,x) = n—11 fpa3n+a—1 pn+ce—a(b+En=1 x2p)e—dp (i^^ da dp, (18)
where n—1 = Lip a3?+a—1 pn+c—1e—alb+E>=1 x )e—dp (nn=1 x?p—^ da dp.
From equations (4.8) and (4.9), it is easy to observe that the posterior expectations are appearing in the form of the ratio of two integrals. Thus, the analytical solution of these expectations are not presumable. Therefore, any numerical approximation techniques may be implemented to
secure the solutions. Here, we used one of the most popular and quite effective approximation technique suggested by [11]. The detailed description is as follows.
fa f u(x, ft)ep(a,ftw dadfi (a,^s = fa fft eP(a,ft )+ dadft (19)
- 1
= (a, ft )ml + 2 [(Uaa + 2uxpx )tkk + (uaft + 2Uxpft )rKft + (Ufta + 2Uftpx )Tfta
a
+ (u ft ft + 2uft P ft )Tft ft] + -ft[(UaTaa + u ft Taft )(l111Taa + 2l21Taft + l12Tft ft) + (uaT fta + u ft Tft ft)(l21 Taa + 2l12 Tfta + l222 тftft)], (20)
where u(a, ft) = (a, ft), p(a, ft) = lng(a, ft) and l = ln L(a,ft|x),
d3l « n „ , „ dp dp
U = dOdftb' «,b = 0 ' ' a + b = 3, pa = ^ pft = dft
du Ua = da ' Uft = du dft ' uaa d2u = dO2, Uftft = d2 U dft2 ' ua = d2U dad ft '
Taa = 1 l20 Taft - 1 TU = Tfta, Tftft 1 l02
Since u(a, ft) is the function of a, ft,
• If u(a,ft ) = a in (4.11), then
ua = 1, u = 0, uaa = u = 0, ua = u a = 0.
• If u(a,ft )= ft in (4.11), then
uft = 1, ua = 0, uaa = u = 0, ua = u a = 0,
and the rest derivatives based on likelihood function are obtained as
3n „ v^ 26, , 2n „ v^ 20/, N3
l30 = -3, ln = -2 £ xf ln xi, lo3 = ft3 - 8a £ xf (ln Xi)3 a i=1 ft i=1
n
I12 = -4 £ xf (ln Xi)2 = l21. i=l
Using these derivatives the Bayes estimators of (a, ft) are obtained by expressions
1 1
a bl =a ml + 2 [(2uaPa )^aa + (2UaPft )Taft ] + 2 i(uaTaa )(ko?aa + 2hlTaft + ¡12?ft ft)
+ (uaTfta )(l21 Taa + 2l12 Tfta + l03 Tft ft)],
(21)
- - 1 1
ft bl = ft ml + 2 [(2uft Pa )rfta + (2uft Pft )Tft ft] + ^ [(uft Taft )(l30Taa + 2l21 Taft + l12Tft ft) ^
+ (uft Tft ft)(l21 Taa + 2l12 Tfta + l03 Tft ft)].
4.3. Simulation study
In this section, a Monte Carlo simulation study has been performed to assess the performance of the obtained estimators in terms of their mean square errors (MSEs). The MLEs of the parameters are evaluated by using nlm() function, and also the MLEs of reliability characteristics are obtained by using invariance properties. The Bayes estimates of the parameters are evaluated by Lindley's
approximation technique. The hyper-parameters values are chosen in such a way that the prior mean is equal to the true value, and prior variance is taken as very small, say 0.5. All the computations are done by R3.4.1 software. At first, we generated 5000 random samples from the PMaD (a, 0) using the Newton-Raphson algorithm for different variation of sample sizes as n = 10 (small), n = 20,30 (moderate), n = 50 (large) for fixed (a = 0.75,0 = 0.75) and secondly for different variation of (a,0) when sample size is fixed (n = 20), respectively. Average estimates and mean square error (MSE) of the parameters are calculated for the above mentioned choices, and the corresponding results are reported in Table 2. The asymptotic confidence interval (ACI) and asymptotic confidence length (ACL) are also obtained and presented in Table 3. From this simulation study, it has been observed that the precision of MLEs and Bayes estimator are increasing when the sample size is increasing while average ACL is decreasing. The Bayes estimates under informative prior is more precise as compared to the MLEs especially for small sample sizes while for large sample the precision of the estimators is almost same for all the considered parametric choices.
Table 2: Average estimates and mean square errors (in each second row) of the parameters and reliability characteristics based on simulated data.
n a , 0 a ml 0ml M(t)ml R(t)ml H(t)ml abl 0bl
10 0.5070 1.1598 1.5119 0.9691 0.1663 0.5063 1.1028
0.0631 0.2588 0.0164 0.0049 0.0947 0.0631 0.2027
20 0.6560 0.8848 1.4922 0.9343 0.2965 0.6521 0.8647
0.75,0.75 0.0098 0.0326 0.0093 0.0014 0.0703 0.0105 0.0263
30 0.7096 0.8064 1.4883 0.9163 0.3504 0.7058 0.7951
0.0022 0.0103 0.0071 0.0004 0.0010 0.0025 0.0087
50 0.7542 0.7453 1.4869 0.8988 0.3968 0.7514 0.7397
0.0003 0.0031 0.0046 0.0001 0.0003 0.0003 0.0031
for fixed n and different a, 0
0.5,0.75 0.6603 0.6832 1.7380 0.9044 0.3400 0.6574 0.6716
0.0261 0.0125 0.0585 0.0017 0.0099 0.0252 0.0117
0.5, 1.5 0.7290 0.3033 4.6222 0.7871 0.3556 0.7258 0.3229
20 0.0528 1.4330 11.9171 0.0402 0.1139 0.0513 1.3866
1.5, 0.5 0.5090 2.9297 1.1531 0.9983 0.0207 0.5517 2.8634
0.9907 6.6465 0.0242 0.1274 26.0695 0.9087 6.3006
2.5,2.5 1.0448 0.5958 1.4084 0.7953 0.6393 1.2825 0.6727
2.1402 3.6573 0.3860 0.0373 0.3553 1.5058 3.3715
Table 3: Interval estimates and asymptotic confidence length (ACL) of the parameters.
n a , 0 aL au ACLa 0L 0u ACl0
10 0.75,0.75 0.0874 0.9266 0.8393 0.5711 1.7485 1.1775
20 0.75,0.75 0.3209 0.9911 0.6703 0.5525 1.2171 0.6646
30 0.75,0.75 0.4263 0.9928 0.5665 0.5555 1.0574 0.5019
50 0.75,0.75 0.5290 0.9794 0.4505 0.5631 0.9275 0.3644
for fixed n and different a, 0
0.5, 0.75 0.3255 0.9951 0.6696 0.4142 0.9523 0.5381
m 0.5, 1.5 0.3794 1.0785 0.6991 0.4819 1.7425 1.2429
20 1.5, 0.5 0.4206 1.7812 0.76058 0.2260 1.8334 1.3807
2.5, 2.5 0.5804 2.9509 0.9788 0.54133 2.7783 1.1365
Figure 2: Empirical cumulative distribution function and QQ plot for the data set-I.
5. Practical data modeling
This section demonstrates the practical applicability of the proposed model in real-life scenario, especially for the survival/reliability data taken from different sources. The proposed distribution is compared with Maxwell distribution (MaD) and its different generalizations, such as, length biased Maxwell distribution (LBMaD), see [9], area biased Maxwell distribution (ABMaD), see [9], extended Maxwell distribution (EMaD), see [8] and generalized Maxwell distribution (GMaD), see [2]. For these models the estimates of the parameter(s) are obtained by method of maximum likelihood and the compatibility of PMaD has been discussed using model selection tools (which depend on the MLE) such as log-likelihood (-log L), Akaike information criterion (AIC), corrected Akaike information criterion (AICC), Bayesian information criterion (BIC) and Kolmogorov Smirnov (K-S) test. In general, the smaller values of these statistics indicate the better fit to the data.
The data sets description is as follows.
Data Set-I (Bladder cancer data): This data set represents the remission times (in months) of a 128 bladder cancer patients, and it was initially used by [12]. The same data set is used to show the superiority of extended Maxwell distribution by [8].
Data Set-II (Item failure data): This data set is taken from [13]. It shows 50 items put into use at initial time t = 0 and failure items recorded in weeks.
Data Set-III (Airborne communication transceiver): The data set was initially considered by [14]. It represent the 46 repair times (in hours) for an airborne communication transceiver.
Data Set-IV (Flood data). The data are the exceedances of flood peaks (in m3/s) of the Wheaton River near Carcross in Yukon Territory, Canada. The data consist of 72 exceedances for the years 1958-1984, rounded to one decimal place. This data set was analyzed by [16].
Summary of the considered data sets is given in Table 5 and it can be seen that skewness is positive for all data sets which indicates that they have positive skewness which appropriately suited to the proposed model. This table also shows platykurtic, mesokurtic and leptokurtic nature of the data, which proves again the suitability of the proposed model to the data.
Table 4: Goodness of fit values for different model.
Bladder cancer data N=128
Model k ß -logL AIC AICC BIC K-S
PMaD 0.7978 0.1637 366.3820 736.7639 732.8599 742.4680 0.3675
MaD 0.0076 - 1014.4440 2030.8870 2028.9190 2033.7400 0.4144
LBMaD 98.6386 - 669.3668 1340.7340 1338.7650 1343.5860 0.4906
ABMaD 78.9109 - 767.8122 1537.6240 1535.6560 1540.4770 0.5608
ExMaD 0.8447 1.4431 412.1232 828.2464 824.3424 833.9504 0.8265
GMaD 0.7484 527.2314 426.6019 857.2037 853.2997 862.9078 0.7086
Item failure data N=50
Model k ß -logL AIC AICC BIC K-S
PMaD 0.8339 0.1820 135.8204 275.6407 271.8961 279.4648 0.2625
MaD 0.0104 - 367.8528 737.7056 735.7890 739.6177 0.4268
LBMaD 72.1146 - 315.1624 632.3248 630.4081 634.2368 0.5112
ABMaD 57.6917 - 374.1247 750.2494 748.3328 752.1615 0.5825
ExMaD 0.6186 1.0139 151.2998 306.5996 302.8550 310.4237 0.7327
GMaD 0.5400 534.1569 151.2643 306.5287 302.7840 310.3527 0.3920
Airborne communication transceiver N=46
Model k ß -logL AIC AICC BIC K-S
PMaD 0.8735 0.2709 101.9125 207.8249 204.1040 211.4822 0.2136
MaD 0.0406 - 245.1383 492.2766 490.3675 494.1052 0.5027
LBMaD 18.4603 - 237.4945 476.9890 475.0799 478.8176 0.5771
ABMaD 14.7683 - 284.7017 571.4034 569.4943 573.2320 0.6324
ExMaD 0.7290 0.8672 103.3052 210.6104 206.8895 214.2677 0.2989
GMaD 0.6015 122.7666 110.8521 225.7042 221.9833 229.3615 0.4392
River data N=72
Model k ß -logL AIC AICC BIC K-S
PMaD 0.805185 0.1504145 212.8942 429.7884 425.9623 434.3418 0.2760
MaD 0.005032 - 610.9235 1223.847 1221.904 1226.124 0.3821
LBMaD 149.0315 - 426.3076 854.6153 852.6724 856.8919 0.4113
ABMaD 119.2252 - 493.3271 988.6543 986.7114 990.9309 0.4529
ExMaD 0.697471 1.306933 251.9244 507.8487 504.0226 512.4021 0.7487
GMaD 0.648149 919.7356 251.2767 506.5534 502.7273 511.1068 0.4998
Table 5: Summary of the data sets.
Data Min Q1 Q2 Mean Q3 Max Kurtosis Skewness
"1 0.080 3.348 6.395 9.366 11.838 79.050 18.483 3.287
II 0.013 1.390 5.320 7.821 10.043 48.105 9.408 2.306
III 0.200 0.800 1.750 3.607 4.375 24.500 11.803 2.888
IV 0.100 2.125 9.500 12.204 20.125 64.000 5.890 1.473
Table 6: ML and Bayes estimates of the four data sets.
Data a ml ßml abl ßbl
I 0.7978 0.1637 0.7962 0.1639
II 0.8339 0.1820 0.8292 0.1821
III 0.8735 0.2709 0.8675 0.2703
IV 0.8052 0.1504 0.8023 0.1506
Table 7: Interval estimates based on the four data sets.
Data aL au ACLa L u ACLß
I 0.6545 0.9411 0.2866 0.1373 0.1902 0.0529
II 0.5962 1.0717 0.4754 0.1376 0.2263 0.0888
III 0.6202 1.1269 0.5067 0.2081 0.3337 0.1256
IV 0.6126 0.9978 0.3852 0.1186 0.1822 0.0636
Emperical cumulative distribution plot for Data set-
Q-Q plot for Data Set-
Q t
E O
- Empirical CDF
---- CDF MLE
I
30
40
50
Q O
0.6 CDF
Figure 3: Empirical cumulative distribution function and QQ plot for the data set-ll.
Emperical cumulative distribution plot for Data set-
Q-Q plot for Data Set-
E O
I
10
Empirical CDF CDF MLE
I
15
I
20
—r
25
a o
y s
S 8* 1 *
J r i»
.. 1
0.2 0.4 0.6 0.8 1.0
CDF
Figure 4: Empirical cumulative distribution function and QQ plot for the data set-Ill.
x
x
Figure 5: Empirical cumulative distribution function and QQ plot for the data set-lV.
From Table 4, it is clear that the proposed model (PMaD) has least value of the model selection tools, which reflects the merit of PMaD for modeling such four practical data sets than the the existing versions of the Maxwell distributions. The empirical cumulative distribution function (ECDF) plots and corresponding QQ plots for all the considered data set are plotted for PMaD, see Figures 2-5. From ECDF and QQ plots, it is clear that the considered data sets are adequately fitted to the proposed model. The point (ML and Bayes) estimates of the parameters for each data set are reported in Table 6. The Bayes estimates are calculated under non-informative prior, and it is observed that the obtained estimates (ML and Bayes) are almost same. The interval estimate of the parameter and corresponding asymptotic confidence length are also evaluated and presented in Table 7. This table shows that as the size of the data increases, the length of the interval is decreases, because it decreases the standard error, which support to our simulation part.
6. Conclusion
This article proposed the power Maxwell distribution (PMaD) as a flexible extension of the Maxwell distribution and studied some of its main properties for data modeling. We also study the skewness and kurtosis of the PMaD and found that it is capable of modeling the positively skewed as well as symmetric data. The unknown parameters of the PMaD are estimated by the maximum likelihood estimation (MLE) and Bayes estimation methods. The MLEs of the reliability function and hazard function are also obtained by using the invariance property. The 95% asymptotic confidence interval for the parameters are constructed using Fisher information matrix. The MLEs and Bayes estimators are compared through the Monte Carlo simulation and observed that Bayes estimators are more precise under informative prior. Finally, medical/reliability data have been used to show practical utility of the PMaD, and it is observed that it provides the better fit comparing to other versions of the Maxwell distributions. Thus, it can be recommended as an alternative model for the non-monotone failure rate models.
Acknowledgements
Authors are very grateful to the editor and reviewers for their recommendation to publish this articel in this reputed journal. The first author greatly acknowledges Banaras Hindu University, India for providing financial support in form of Seed grant under the Institute of Eminence
Scheme (scheme no. Dev. 6031).
References
[1] Tyagi, R. K., and Bhattacharya, S. K. (1989). A note on the MVU estimation of reliability for the Maxwell failure distribution. Estadistica 41:73-79.
[2] Chaturvedi, A., and U. Rani. (1998). Classical and Bayesian reliability estimation of the generalized Maxwell failure distribution. Journal of Statistical Research 32:113-20.
[3] Bekker, A., and Roux, J. J. (2005). Reliability characteristics of the Maxwell distribution: A Bayes estimation study. Communications in Statistics - Theory and Methods 34:2169-78.
[4] Radha R. K. and Vekatesan P. (2013). On the double prior selection for the parameter of Maxwell distribution, International Journal of Scientific & Engineering Research, Volume 4, Issue 5.
[5] Shakil M., Golam B. M. K. and Chang K. C. (2008). Distributions of the product and ratio of Maxwell and Rayleigh random variables, Statistical Papers, 49:729-747.
[6] Dey, S., and Maiti, S. S. (2010). Bayesian estimation of the parameter of Maxwell distribution under different loss functions. Journal of Statistical Theory and Practice 4:279-287.
[7] Tomer, S. K., and Panwar, M. S. (2015). Estimation procedures for Maxwell distribution under type I progressive hybrid censoring scheme. Journal of Statistical Computation and Simulation 85:339-356.
[8] Sharma, V. K., Bakouch, H. S. and Khushboo Suthar, K. (2017a). An extended Maxwell distribution: Properties and applications, Communications in Statistics - Simulation and Computation, 46:9, 6982-7007.
[9] Sharma, V. K., Dey, S., Singh, K. S. and Manzoor, U. (2017b). On Length and Area biased Maxwell distributions, Communications in Statistics - Simulation and Computation, DOI: 10.1080/03610918.2017.1317804.
[10] Renyi A. (1961). On measures of entropy and information, in: Proceedings of the 4th Berkeley Symposium on Mathematical Statistics and Probability, University of California Press, Berkeley.
[11] Lindley, D. V., (1980). Approximate Bayes method, Trabajos de estadistica, Vol. 31, 223-237, 1980.
[12] Lee, E. T. and Wang, J. W. (2003). Statistical Methods for Survival Data Analysis. Wiley, New York, DOI:10.1002/0471458546.
[13] Murthy D. N. P., Xie M., Jiang R. (2004). Weibull model. Wiley, New York.
[14] Chhikara, R. S. and Folks, J. L. (1977). The inverse gaussian distribution as a lifetime model, Technometrics 19: 461-468.
[15] Bonferroni C. E. ( 1930), Elementi di Statistica General, Seeber, Firenze.
[16] Choulakian, V., Stephens, M. A. (2001). Goodness-of-fit tests for the generalized Pareto distribution, Technometrics 43(4), 478-484.