Научная статья на тему 'Estimation procedures for a flexible extension of Maxwell distribution with data modeling'

Estimation procedures for a flexible extension of Maxwell distribution with data modeling Текст научной статьи по специальности «Науки о Земле и смежные экологические науки»

CC BY
76
22
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
Family of Maxwell distributions / Entropy / Classical and Bayes estimation / Interval estimation / Asymptotic confidence length

Аннотация научной статьи по наукам о Земле и смежным экологическим наукам, автор научной работы — Abhimanyu Singh Yadav, H.S. Bakouch, S.K. Singh, Umesh Singh

In this paper, we introduce a flexible extension of the Maxwell distribution for modeling various practical data with non-monotone failure rate. Some main properties of this distribution are obtained, and then the estimation of the parameters for the proposed distribution has been addressed by maximum likelihood estimation method and Bayes estimation method. The Bayes estimators have been obtained under gamma prior using squared error loss function. Also, a simulation study is gained to assess the estimates performance. A real-life applications for the proposed distribution have been illustrated through different lifetime data.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «Estimation procedures for a flexible extension of Maxwell distribution with data modeling»

Estimation procedures for a flexible extension of Maxwell distribution with data modeling

Abhimanyu Singh Yadav*1, H. S. Bakouch2, S. K. Singh3 and Umesh Singh4

department of Statistics, Banaras Hindu University, Varanasi, India.

E-mail: [email protected], [email protected] [email protected] 2Department of Mathematics, Faculty of Science, Tanta University, Tanta, Egypt. E-mail: [email protected] * Corresponding Author

Abstract

In this paper, we introduce a flexible extension of the Maxwell distribution for modeling various practical data with non-monotone failure rate. Some main properties of this distribution are obtained, and then the estimation of the parameters for the proposed distribution has been addressed by maximum likelihood estimation method and Bayes estimation method. The Bayes estimators have been obtained under gamma prior using squared error loss function. Also, a simulation study is gained to assess the estimates performance. A real-life applications for the proposed distribution have been illustrated through different lifetime data.

Keywords: Family of Maxwell distributions, Entropy, Classical and Bayes estimation, Interval estimation, Asymptotic confidence length.

1. Introduction

The Maxwell distribution has broad application in statistical physics, physical chemistry, and their related areas. Besides Physics and Chemistry it has also a good number of applications in reliability theory. At first, the Maxwell distribution was used as lifetime distribution by [1]. The inferences based on generalized Maxwell distribution have been discussed by [2]. [3] considered the estimation of reliability characteristics for Maxwell distribution under Bayes paradigm. [4] discussed the prior selection procedure in case of Maxwell distribution. [5] studied the distributions of the product |XY| and ratio |X/Y|, where X and Y are independent random variables having the Maxwell and Rayleigh distributions, respectively. [6] proposed the Bayesian estimation of the Maxwell parameters. [7] discussed the estimation procedure for the Maxwell parameters under progressive type-I hybrid censored data. Furthermore, several generalizations based on Maxwell distribution are advocated and statistically justified. Recently, two more extensions of Maxwell distribution has been introduced by [8], [9] and discussed the classical as well as Bayesian estimation of the parameter along with real-life applications.

A random variable Z follows the Maxwell distribution (MaD) with scale parameter a, denoted as Z ~ MaD (a), if its probability density function (PDF) and cumulative distribution function (CDF) are given by

4 3 2

f (z, a) = a2z2e-az z > 0, a > 0 (1)

and

F(z, a) = -2= r( 2, az2), (2)

respectively, where T(a, z) = /0 pa-1e-pdp is the incomplete gamma function.

In this article, we propose a flexible extension of the Maxwell distribution. The objective of this article is to get some main properties of this distribution for showing its merit in modeling various practical data, and then estimate the unknown parameters using classical and Bayes estimation methods. Other motivations regarding the advantages of the distribution comes from its flexibility to model the data with non-monotone failure rates. The former aim is justified, where the proposed distribution provides better fit to the reliability/survival data comparing to the some known and recent versions of the Maxwell distribution. Further, the distribution is that having the nature of platykurtic, mesokurtic and leptokurtic, hence it can be used to model skewed and symmetric data as well. Also, the Bayes procedure under informative prior provides the more efficient estimates as compared to the maximum likelihood estimates (MLEs) concerning the estimation point of view. Another motivation for the confidence interval of the distribution parameters is that increasing the sample size decreases the width of confidence intervals, because it decreases the standard error, and this justified by simulation study and using sizes of four practical data sets.

The reminder of the considered work has been structured in the following manner. Section 2 provides some statistical properties related to the proposed model for purpose of data modeling. In Section 3, some types of entropy are investigated. The maximum likelihood (ML) and Bayes estimation procedures have been discussed in Section 4. Also, a simulation study is carried out to compare the performance of Bayes estimates with MLEs. In Section 5, we illustrate the application and usefulness of the proposed model by applying it to four practical data sets. Section 5 offers some concluding remarks.

2. The model and some of its properties

This section provides another generalization of the MaD using power transformation of Maxwell random variates for estimations issues of the distribution parameters and modeling practical

1

data. For this purpose, consider the transformation X = Z p, where Z ~ MaD (a), hence the resulting distribution of X is called as power Maxwell distribution (for short PMaD) and denoted by X ~ PMaD (a, p), where, a and p are the scale and shape parameters, respectively. The PDF and CDF of the PMaD are given by

f (x, a, p) = a3px3p-1e-ax2p, x > 0, a, p > 0, (3)

V n

F(x, a, p) = -2= r(2, ax2p) , (4)

respectively. Plots of the PDF are given by Figure 1 for different choices of a and p. The plots show different kurtosis, positive skewness and symmetric shapes.

Some main mathematical and statistical properties of PMaD have been obtained in the following.

2.1. Behaviour with some reliability functions

This subsection, described the asymptotic nature of density and survival functions for the proposed distribution. To illustrate asymptotic behaviour, at first, we will show that lim f (x, a, ß) = 0

and lim f (x, a, ß) = 0 . Therefore, using (2.1)

lim f (x, a,ß) = ^a3ß lim x3ß-1 e-ax2ß = 0,

x^0 s/n x^0

Density plot

Hazard function plot

2.0 -

1.5 -

X 1.0-

0.5 -

0.0 -

0.5

I

1.0

— a=0.75 , p = 0.75

— a= 1 , p = 0.75 a=0.75 , p = 1.5 a= 1.5 , p = 1.5 a=2,p=2

I

1.5

I

2.0

1.5 -

1.0 -

>x

I

0.5 -

0.0 -

x

x

Figure 1: Density function and hazard function plot for different choices of a and f>.

and

lim f (x, a, ß) = —=a2ß lim x3ß-1 lim e-ax2ß = 0

Wn x^m x^m

The characteristics based on reliability function and hazard function are very useful to study the pattern of any lifetime phenomenon. Let X be a random variable with PDF (2.1) and CDF (2.2), different reliability measures for the proposed distribution are obtained by following equations.

The reliability function R( x) is given by

R(x) = P(X > x) = 1 - —= r (3,ax2ß^j

(5)

The mean time to system failure M( x) is

M(x) = E(x) = 1) ' r( ^

The hazard function H( x) is given as

H(x)

f(x, a, ß) _ 4a2ßx3ß-1 e-ax2ß

1 - F(x, a,ß) - 2f ax2ß)

(6)

(7)

The plots, in Figure 1, show that the proposed density is unimodel and positively skewed with monotone failure rate function for the different combination of the model parameters. The comparative behavior of the random variables can be measured by stochastic ordering concept

that is summarized in the next proposition.

Proposition: Let X ~ PMaD(a1, 01) and Y ~ PMaD(a2, 02), then the likelihood ratio is

O

fx(x) = ((01] x3(pi-^2)e-(«ix201 +«2x202).

fY (x)

a2

Therefore,

O

log f fx^^ = I

H fY (X)J X

02

3(01 - 02) - («1x201 + a2x202)

If pi = p2 = p, then O < 0, which implies that the random variable X is a likelihood ratio order than Y, that is X </r Y. Also, if a1 = a2 = a and p1 < p2, then again O < 0, which shows that X <ir Y. Other stochastic orderings behaviour follow using X </r Y, such as hazard rate order (X <hr Y), mean residual life order (X <mr/ Y) and scholastically greater (X <st Y).

2.2. Moments and some conditional ones

Let x1, x2, ■ ■ ■ xn be random observations from the PMaD(a, p). The rth moment, \lr, about origin is

r .

ur = I xrf(x, a, p) dx = ( 1 | p r ( 3p + r Jx=0 Jy HJ -n\a) V 2p

The coefficient of skewness and kurtosis measure the convexity of the curve and its shape. Using the moments above, the two earlier measures are obtained by moments based relations suggested by Pearson and given by

r > 1.

01

v3 - 3v2 v1 +2 (vi)

v2 - (v1)2

and

02

V4 - 4V3+ 6V2 - 3

v2- (v1)

Numerical values of some measures above are calculated in Table 1 for different combination of the model parameters, and it is observed that the shape of the PMaD is right skewed and almost symmetrical for some choices of a, p. Also, it can has the nature of platykurtic, mesokurtic and leptokurtic, thus PMaD may be used to model skewed and symmetric data as well.

The mode (M0) for PMaD (a, p) is obtained by solving the following expression -—f (x, a, p) |M0

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

dx

0, which yields

M0 = (32ipi) * •

Moreover, the median ( Md) of the proposed distribution can be calculated by using the empirical relation among the mean, median and mode. Thus, the median is,

1 2 / 1

Md = g Mo + 3 = 3

30 - 1N 20 4 2a0 ) + Tn

30 + 1 20

The moment generating function (mgf) MX(t) for a PMaD random variable X is obtained as

2 M 1 / t Mx (t) = E(etx ) = ^ j 02?

r 30 + r

2

3

2

4

2

1

r

a

Table 1: Values of mean, variance, skewness, kurtosis, mode and coefficient of variation for different a, f

a, i ft ft il f 2 x0 CV

when a fixed and f varying

0.5, 0.5 3.0008 5.9992 2.6675 7.0010 1.0000 0.8162

0.5, 1.0 1.5962 0.4530 0.2384 3.1071 1.4142 0.4217

0.5, 1.5 1.3376 0.1499 0.0102 2.7882 1.3264 0.2894

0.5, 2.5 1.1780 0.0445 0.0481 2.7890 1.2106 0.1792

0.5, 3.5 1.1204 0.0211 0.1037 2.4351 1.1533 0.1298

when i fixed a varying

0.5, 0.75 1.9392 1.1443 0.7425 3.8789 1.4057 0.5516

1.0, 0.75 1.2216 0.4541 0.7425 3.8789 0.8855 0.5516

1.5, 0.75 0.9323 0.2645 0.7425 3.8789 0.6758 0.5516

2.5, 0.75 0.6632 0.1338 0.7425 3.8789 0.4807 0.5516

3.5, 0.75 0.5299 0.0855 0.7425 3.8789 0.3841 0.5516

when both varying

1, 1 1.1287 0.2265 0.2384 3.1071 1.0000 0.4217

2, 2 0.8723 0.0372 0.0102 2.7895 0.8891 0.2212

3, 3 0.8484 0.0163 0.0831 2.6907 0.8736 0.1506

4,4 0.8509 0.0094 0.1069 1.9643 0.8750 0.1140

5, 5 0.8586 0.0062 0.0677 0.1072 0.8805 0.0915

For lifetime distributions, the conditional moments are of interest in prediction. Another application of conditional moments is the mean residual life (MRL). For this purpose, let X observed from PDF(2.1), the conditional moments, E(Xr |X > k) and the conditional mgf E(etx|X > k) are obtained as follows;

E(Xr|X > k)

L>kxrf a f)dx L>kf (x, a i)dx

2(i)2i r

and

Jxlv^1A_ L>ketxf (x, a,i)dx

E(e |X > k)

L>kf (x a i)dx

/ \ JL t_ f l\ 2i

^i=0 i!

3f + r 2f

, ak2i

n - 2r (3, ak2i)

2 " r

3f + r 2f

, ak2i

n — 2r (§, a.k2P) '

respectively. The MRL is the expected remaining life X — x, given that the equipment has survived to time k. The MRL function in terms of the first conditional moments is given as

2(±r r

m(x) = E[X - x|X > k]

3f + 1 2f

, ak2i

n - 2r (3, ak2i)

- x

3. Entropy measurements

In information theory, entropy measurement plays a vital role to study the uncertainty associated with the random variable. In this section, we discuss the different entropy measures for PMaD. For more detail about entropy measurement, see [10].

3.1. Renyi entropy

Renyi entropy of a r.v. X with PDF (2.1) is given as

Re

(1-i)

ln

4 3

I 1 "r=a2

-1e-xx2P I dx

Hence, after some algebra, we get

D 1 N, „ A, ,, a 1 -A - 20. 3A0 - A + 1 ^30A - A + 1

Re = (I-i) Aln4 - 2ln n + Aln0 lna -ln A + H 20-

1

3.2. A-entropy

The A entropy is also known as p entropy. The A entropy for a random variable X having PDF (2.1) is defined as

Ae

1

A- 1

1 - fA (x, a, 0)dx Jx=0

Using PDF (2.1) and after simplification, the expression for p-entropy is given by;

Ae

A- 1

1 -(-40a a

1 - A - 20 /rf 3A0 - A +

20

20

3A0 - A + 1

V A 20

(8)

3.3. Generalized entropy

The generalized entropy is defined by

G = va«-A ~ 1 . A = 0 . Ge = A(A - 1) 'A = 0,1,

where, vA = Jx°=0 xAf (x, a, 9)dx and « = E(X). After some algebra, we get

ge = . ^

30 + A ~2f~

r

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

30 + 1 20

-A

A(A - 1)

A = 0,1.

(9)

4. Parameter estimation with a simulation study

Here, we describe the maximum likelihood estimation method and Bayes estimation method for estimating the unknown parameters a, p of the PMaD. The estimators obtained under these methods are not in nice closed form; thus, numerical approximation techniques are used to get the solution. Further, the performances of these estimators are studied through a Monte Carlo simulation.

4.1. Maximum likelihood estimation

The most popular and efficient method of classical estimation of the parameter(s) is maximum

likelihood estimation. The estimators obtained by this method passes several desirable properties

CO

1

r

such as consistency, efficiency etc. Let Xi, X2, ■ ■ ■ , Xn be an iid random sample of size n taken from PMaD (a, ft), then the likelihood function is

L(a, 6) = ft -4=a2ftxf-1 e-ax? = aZ=i *? (n*^) ,

hence the corresponding log-likelihood function is written as

n 3 n n 2 n

ln L(a, 6) = l = n ln4 - - ln n + — ln a + n ln ft - a Z x-ft + (3ft - 1) Z ln xi. (10)

2 2 i=i i=i dl dl

The MLEs of a and ft are the solution of — = 0 and — = 0, hence

da oft

3n

n

3n - E x2P = 0 (11)

2a

i=1

n n 9« n

- — 2a £ xf ln xi + 3 £ ln xi = 0. (12)

P i=1 i=1

The MLEs of the parameters are obtained by solving the two equations above simultaneously, and non-linear maximization techniques is used to get the solution.

4.1.1 Uniqueness of MLEs

The uniqueness of the MLEs discussed in the previous section can be checked by using following propositions.

Proposition 1: If ft is fixed, then a exists and is unique.

3n 2P

Proof: Let La = ---£=1 x; P, since La is continuous and it has been verified that lim La = to

2a i a^o

and lim La = — £=-1 x2 < 0. This implies that La will have at least one root in interval (0, to)

a^TO 11 i 1

and hence La is a decreasing function in a. Thus, La = 0 has a unique solution in (0, to). Proposition 2: If a is fixed, then ft exists and is unique.

Proof: Let La = - - a En

p = P — a En=i x2p ln xi + 3 E"=i ln xi, since Lp is continuous and it has been verified that lim Lp = to and lim Lp = —2 E,"=i ln xi < 0. This implies, as above, ft exists and it

P P^to p i=1 r

is unique.

4.1.2 Fisher Information Matrix

Here, we derive the Fisher information matrix for constructing 100(1 — Y)% asymptotic confidence interval for the parameters using large sample theory. The Fisher information matrix can be obtained, by using equations (4.2) and (4.3), as

'laa lap

I (a, ft ) = —E I I (5.2.1)

where,

\lpa lpp/ (a,p)

3n n n n

- 2"2, laf> = -2 E xfln xi, lpp = - - 4a E xf (ln xi )2. 2a i=1 P i=1

respectively.

The above matrix can be inverted and the diagonal elements of I-1(a, p) provide the asymptotic variance of a and p, respectively. Now, two sided 100(1 — Y)% asymptotic confidence interval for a, p can be obtained as

a e [a — Z1—t \Jvar(a), a + Z1—t\Jvar(a)], p e [p — Z1—t jvar(p), p + Z1—t jvar(p)],

4.2. Bayes estimation

In this subsection, the Bayes estimation procedure for the PMaD parameters has been developed. Here, we consider two independent gamma priors for both shape and scale parameter. The considered prior is very flexible due to its flexibility of assuming different shape. Thus, the joint prior g(a, p) is given by;

g(a, p) a aa—1 pc—1 e—ba—dp ; a, p > 0, (13)

where a, b, c and d are the hyper-parameters of the considered priors. Using likelihood function of PMaD and equation above, the joint posterior density function n(a, p|x) is derived as

n(a, p\x)~ L(x|a,p)g(a,p)

Lip L(x\a p)g(a, p)da dp

a 3n+a—1 pn+c—1e—«(b+E,n=1x2p) e—dp rnn=1 x3p—1 ) (14)

L fpa3n+a—1 pn+c—1e—'T+E'=1 xOe—(nn=1 xfp—^ da dp

In the Bayesian analysis, the specification of proper loss function plays an important role. We talk most frequently used the square error loss function (SELF) to obtain the estimators of the parameters, which defined as

L($, $) a ($ — $)2, (15)

where $ is estimate of $. Bayes estimators under SELF is the posterior mean and evaluated by

$SELF = [E($\x)] , (16)

provided the expectation exist and finite. Thus, the Bayes estimators based on equation no. (4.5) under SELF are given by

kbs = Eap\x(a\p,x) = n—11 pa3?+apn+c—V«^1 ^e—dp (j^^ da dp, (17)

and

pbs = Epx(p\a,x) = n—11 fpa3n+a—1 pn+ce—a(b+En=1 x2p)e—dp (i^^ da dp, (18)

where n—1 = Lip a3?+a—1 pn+c—1e—alb+E>=1 x )e—dp (nn=1 x?p—^ da dp.

From equations (4.8) and (4.9), it is easy to observe that the posterior expectations are appearing in the form of the ratio of two integrals. Thus, the analytical solution of these expectations are not presumable. Therefore, any numerical approximation techniques may be implemented to

secure the solutions. Here, we used one of the most popular and quite effective approximation technique suggested by [11]. The detailed description is as follows.

fa f u(x, ft)ep(a,ftw dadfi (a,^s = fa fft eP(a,ft )+ dadft (19)

- 1

= (a, ft )ml + 2 [(Uaa + 2uxpx )tkk + (uaft + 2Uxpft )rKft + (Ufta + 2Uftpx )Tfta

a

+ (u ft ft + 2uft P ft )Tft ft] + -ft[(UaTaa + u ft Taft )(l111Taa + 2l21Taft + l12Tft ft) + (uaT fta + u ft Tft ft)(l21 Taa + 2l12 Tfta + l222 тftft)], (20)

where u(a, ft) = (a, ft), p(a, ft) = lng(a, ft) and l = ln L(a,ft|x),

d3l « n „ , „ dp dp

U = dOdftb' «,b = 0 ' ' a + b = 3, pa = ^ pft = dft

du Ua = da ' Uft = du dft ' uaa d2u = dO2, Uftft = d2 U dft2 ' ua = d2U dad ft '

Taa = 1 l20 Taft - 1 TU = Tfta, Tftft 1 l02

Since u(a, ft) is the function of a, ft,

• If u(a,ft ) = a in (4.11), then

ua = 1, u = 0, uaa = u = 0, ua = u a = 0.

• If u(a,ft )= ft in (4.11), then

uft = 1, ua = 0, uaa = u = 0, ua = u a = 0,

and the rest derivatives based on likelihood function are obtained as

3n „ v^ 26, , 2n „ v^ 20/, N3

l30 = -3, ln = -2 £ xf ln xi, lo3 = ft3 - 8a £ xf (ln Xi)3 a i=1 ft i=1

n

I12 = -4 £ xf (ln Xi)2 = l21. i=l

Using these derivatives the Bayes estimators of (a, ft) are obtained by expressions

1 1

a bl =a ml + 2 [(2uaPa )^aa + (2UaPft )Taft ] + 2 i(uaTaa )(ko?aa + 2hlTaft + ¡12?ft ft)

+ (uaTfta )(l21 Taa + 2l12 Tfta + l03 Tft ft)],

(21)

- - 1 1

ft bl = ft ml + 2 [(2uft Pa )rfta + (2uft Pft )Tft ft] + ^ [(uft Taft )(l30Taa + 2l21 Taft + l12Tft ft) ^

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

+ (uft Tft ft)(l21 Taa + 2l12 Tfta + l03 Tft ft)].

4.3. Simulation study

In this section, a Monte Carlo simulation study has been performed to assess the performance of the obtained estimators in terms of their mean square errors (MSEs). The MLEs of the parameters are evaluated by using nlm() function, and also the MLEs of reliability characteristics are obtained by using invariance properties. The Bayes estimates of the parameters are evaluated by Lindley's

approximation technique. The hyper-parameters values are chosen in such a way that the prior mean is equal to the true value, and prior variance is taken as very small, say 0.5. All the computations are done by R3.4.1 software. At first, we generated 5000 random samples from the PMaD (a, 0) using the Newton-Raphson algorithm for different variation of sample sizes as n = 10 (small), n = 20,30 (moderate), n = 50 (large) for fixed (a = 0.75,0 = 0.75) and secondly for different variation of (a,0) when sample size is fixed (n = 20), respectively. Average estimates and mean square error (MSE) of the parameters are calculated for the above mentioned choices, and the corresponding results are reported in Table 2. The asymptotic confidence interval (ACI) and asymptotic confidence length (ACL) are also obtained and presented in Table 3. From this simulation study, it has been observed that the precision of MLEs and Bayes estimator are increasing when the sample size is increasing while average ACL is decreasing. The Bayes estimates under informative prior is more precise as compared to the MLEs especially for small sample sizes while for large sample the precision of the estimators is almost same for all the considered parametric choices.

Table 2: Average estimates and mean square errors (in each second row) of the parameters and reliability characteristics based on simulated data.

n a , 0 a ml 0ml M(t)ml R(t)ml H(t)ml abl 0bl

10 0.5070 1.1598 1.5119 0.9691 0.1663 0.5063 1.1028

0.0631 0.2588 0.0164 0.0049 0.0947 0.0631 0.2027

20 0.6560 0.8848 1.4922 0.9343 0.2965 0.6521 0.8647

0.75,0.75 0.0098 0.0326 0.0093 0.0014 0.0703 0.0105 0.0263

30 0.7096 0.8064 1.4883 0.9163 0.3504 0.7058 0.7951

0.0022 0.0103 0.0071 0.0004 0.0010 0.0025 0.0087

50 0.7542 0.7453 1.4869 0.8988 0.3968 0.7514 0.7397

0.0003 0.0031 0.0046 0.0001 0.0003 0.0003 0.0031

for fixed n and different a, 0

0.5,0.75 0.6603 0.6832 1.7380 0.9044 0.3400 0.6574 0.6716

0.0261 0.0125 0.0585 0.0017 0.0099 0.0252 0.0117

0.5, 1.5 0.7290 0.3033 4.6222 0.7871 0.3556 0.7258 0.3229

20 0.0528 1.4330 11.9171 0.0402 0.1139 0.0513 1.3866

1.5, 0.5 0.5090 2.9297 1.1531 0.9983 0.0207 0.5517 2.8634

0.9907 6.6465 0.0242 0.1274 26.0695 0.9087 6.3006

2.5,2.5 1.0448 0.5958 1.4084 0.7953 0.6393 1.2825 0.6727

2.1402 3.6573 0.3860 0.0373 0.3553 1.5058 3.3715

Table 3: Interval estimates and asymptotic confidence length (ACL) of the parameters.

n a , 0 aL au ACLa 0L 0u ACl0

10 0.75,0.75 0.0874 0.9266 0.8393 0.5711 1.7485 1.1775

20 0.75,0.75 0.3209 0.9911 0.6703 0.5525 1.2171 0.6646

30 0.75,0.75 0.4263 0.9928 0.5665 0.5555 1.0574 0.5019

50 0.75,0.75 0.5290 0.9794 0.4505 0.5631 0.9275 0.3644

for fixed n and different a, 0

0.5, 0.75 0.3255 0.9951 0.6696 0.4142 0.9523 0.5381

m 0.5, 1.5 0.3794 1.0785 0.6991 0.4819 1.7425 1.2429

20 1.5, 0.5 0.4206 1.7812 0.76058 0.2260 1.8334 1.3807

2.5, 2.5 0.5804 2.9509 0.9788 0.54133 2.7783 1.1365

Figure 2: Empirical cumulative distribution function and QQ plot for the data set-I.

5. Practical data modeling

This section demonstrates the practical applicability of the proposed model in real-life scenario, especially for the survival/reliability data taken from different sources. The proposed distribution is compared with Maxwell distribution (MaD) and its different generalizations, such as, length biased Maxwell distribution (LBMaD), see [9], area biased Maxwell distribution (ABMaD), see [9], extended Maxwell distribution (EMaD), see [8] and generalized Maxwell distribution (GMaD), see [2]. For these models the estimates of the parameter(s) are obtained by method of maximum likelihood and the compatibility of PMaD has been discussed using model selection tools (which depend on the MLE) such as log-likelihood (-log L), Akaike information criterion (AIC), corrected Akaike information criterion (AICC), Bayesian information criterion (BIC) and Kolmogorov Smirnov (K-S) test. In general, the smaller values of these statistics indicate the better fit to the data.

The data sets description is as follows.

Data Set-I (Bladder cancer data): This data set represents the remission times (in months) of a 128 bladder cancer patients, and it was initially used by [12]. The same data set is used to show the superiority of extended Maxwell distribution by [8].

Data Set-II (Item failure data): This data set is taken from [13]. It shows 50 items put into use at initial time t = 0 and failure items recorded in weeks.

Data Set-III (Airborne communication transceiver): The data set was initially considered by [14]. It represent the 46 repair times (in hours) for an airborne communication transceiver.

Data Set-IV (Flood data). The data are the exceedances of flood peaks (in m3/s) of the Wheaton River near Carcross in Yukon Territory, Canada. The data consist of 72 exceedances for the years 1958-1984, rounded to one decimal place. This data set was analyzed by [16].

Summary of the considered data sets is given in Table 5 and it can be seen that skewness is positive for all data sets which indicates that they have positive skewness which appropriately suited to the proposed model. This table also shows platykurtic, mesokurtic and leptokurtic nature of the data, which proves again the suitability of the proposed model to the data.

Table 4: Goodness of fit values for different model.

Bladder cancer data N=128

Model k ß -logL AIC AICC BIC K-S

PMaD 0.7978 0.1637 366.3820 736.7639 732.8599 742.4680 0.3675

MaD 0.0076 - 1014.4440 2030.8870 2028.9190 2033.7400 0.4144

LBMaD 98.6386 - 669.3668 1340.7340 1338.7650 1343.5860 0.4906

ABMaD 78.9109 - 767.8122 1537.6240 1535.6560 1540.4770 0.5608

ExMaD 0.8447 1.4431 412.1232 828.2464 824.3424 833.9504 0.8265

GMaD 0.7484 527.2314 426.6019 857.2037 853.2997 862.9078 0.7086

Item failure data N=50

Model k ß -logL AIC AICC BIC K-S

PMaD 0.8339 0.1820 135.8204 275.6407 271.8961 279.4648 0.2625

MaD 0.0104 - 367.8528 737.7056 735.7890 739.6177 0.4268

LBMaD 72.1146 - 315.1624 632.3248 630.4081 634.2368 0.5112

ABMaD 57.6917 - 374.1247 750.2494 748.3328 752.1615 0.5825

ExMaD 0.6186 1.0139 151.2998 306.5996 302.8550 310.4237 0.7327

GMaD 0.5400 534.1569 151.2643 306.5287 302.7840 310.3527 0.3920

Airborne communication transceiver N=46

Model k ß -logL AIC AICC BIC K-S

PMaD 0.8735 0.2709 101.9125 207.8249 204.1040 211.4822 0.2136

MaD 0.0406 - 245.1383 492.2766 490.3675 494.1052 0.5027

LBMaD 18.4603 - 237.4945 476.9890 475.0799 478.8176 0.5771

ABMaD 14.7683 - 284.7017 571.4034 569.4943 573.2320 0.6324

ExMaD 0.7290 0.8672 103.3052 210.6104 206.8895 214.2677 0.2989

GMaD 0.6015 122.7666 110.8521 225.7042 221.9833 229.3615 0.4392

River data N=72

Model k ß -logL AIC AICC BIC K-S

PMaD 0.805185 0.1504145 212.8942 429.7884 425.9623 434.3418 0.2760

MaD 0.005032 - 610.9235 1223.847 1221.904 1226.124 0.3821

LBMaD 149.0315 - 426.3076 854.6153 852.6724 856.8919 0.4113

ABMaD 119.2252 - 493.3271 988.6543 986.7114 990.9309 0.4529

ExMaD 0.697471 1.306933 251.9244 507.8487 504.0226 512.4021 0.7487

GMaD 0.648149 919.7356 251.2767 506.5534 502.7273 511.1068 0.4998

Table 5: Summary of the data sets.

Data Min Q1 Q2 Mean Q3 Max Kurtosis Skewness

"1 0.080 3.348 6.395 9.366 11.838 79.050 18.483 3.287

II 0.013 1.390 5.320 7.821 10.043 48.105 9.408 2.306

III 0.200 0.800 1.750 3.607 4.375 24.500 11.803 2.888

IV 0.100 2.125 9.500 12.204 20.125 64.000 5.890 1.473

Table 6: ML and Bayes estimates of the four data sets.

Data a ml ßml abl ßbl

I 0.7978 0.1637 0.7962 0.1639

II 0.8339 0.1820 0.8292 0.1821

III 0.8735 0.2709 0.8675 0.2703

IV 0.8052 0.1504 0.8023 0.1506

Table 7: Interval estimates based on the four data sets.

Data aL au ACLa L u ACLß

I 0.6545 0.9411 0.2866 0.1373 0.1902 0.0529

II 0.5962 1.0717 0.4754 0.1376 0.2263 0.0888

III 0.6202 1.1269 0.5067 0.2081 0.3337 0.1256

IV 0.6126 0.9978 0.3852 0.1186 0.1822 0.0636

Emperical cumulative distribution plot for Data set-

Q-Q plot for Data Set-

Q t

E O

- Empirical CDF

---- CDF MLE

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

I

30

40

50

Q O

0.6 CDF

Figure 3: Empirical cumulative distribution function and QQ plot for the data set-ll.

Emperical cumulative distribution plot for Data set-

Q-Q plot for Data Set-

E O

I

10

Empirical CDF CDF MLE

I

15

I

20

—r

25

a o

y s

S 8* 1 *

J r i»

.. 1

0.2 0.4 0.6 0.8 1.0

CDF

Figure 4: Empirical cumulative distribution function and QQ plot for the data set-Ill.

x

x

Figure 5: Empirical cumulative distribution function and QQ plot for the data set-lV.

From Table 4, it is clear that the proposed model (PMaD) has least value of the model selection tools, which reflects the merit of PMaD for modeling such four practical data sets than the the existing versions of the Maxwell distributions. The empirical cumulative distribution function (ECDF) plots and corresponding QQ plots for all the considered data set are plotted for PMaD, see Figures 2-5. From ECDF and QQ plots, it is clear that the considered data sets are adequately fitted to the proposed model. The point (ML and Bayes) estimates of the parameters for each data set are reported in Table 6. The Bayes estimates are calculated under non-informative prior, and it is observed that the obtained estimates (ML and Bayes) are almost same. The interval estimate of the parameter and corresponding asymptotic confidence length are also evaluated and presented in Table 7. This table shows that as the size of the data increases, the length of the interval is decreases, because it decreases the standard error, which support to our simulation part.

6. Conclusion

This article proposed the power Maxwell distribution (PMaD) as a flexible extension of the Maxwell distribution and studied some of its main properties for data modeling. We also study the skewness and kurtosis of the PMaD and found that it is capable of modeling the positively skewed as well as symmetric data. The unknown parameters of the PMaD are estimated by the maximum likelihood estimation (MLE) and Bayes estimation methods. The MLEs of the reliability function and hazard function are also obtained by using the invariance property. The 95% asymptotic confidence interval for the parameters are constructed using Fisher information matrix. The MLEs and Bayes estimators are compared through the Monte Carlo simulation and observed that Bayes estimators are more precise under informative prior. Finally, medical/reliability data have been used to show practical utility of the PMaD, and it is observed that it provides the better fit comparing to other versions of the Maxwell distributions. Thus, it can be recommended as an alternative model for the non-monotone failure rate models.

Acknowledgements

Authors are very grateful to the editor and reviewers for their recommendation to publish this articel in this reputed journal. The first author greatly acknowledges Banaras Hindu University, India for providing financial support in form of Seed grant under the Institute of Eminence

Scheme (scheme no. Dev. 6031).

References

[1] Tyagi, R. K., and Bhattacharya, S. K. (1989). A note on the MVU estimation of reliability for the Maxwell failure distribution. Estadistica 41:73-79.

[2] Chaturvedi, A., and U. Rani. (1998). Classical and Bayesian reliability estimation of the generalized Maxwell failure distribution. Journal of Statistical Research 32:113-20.

[3] Bekker, A., and Roux, J. J. (2005). Reliability characteristics of the Maxwell distribution: A Bayes estimation study. Communications in Statistics - Theory and Methods 34:2169-78.

[4] Radha R. K. and Vekatesan P. (2013). On the double prior selection for the parameter of Maxwell distribution, International Journal of Scientific & Engineering Research, Volume 4, Issue 5.

[5] Shakil M., Golam B. M. K. and Chang K. C. (2008). Distributions of the product and ratio of Maxwell and Rayleigh random variables, Statistical Papers, 49:729-747.

[6] Dey, S., and Maiti, S. S. (2010). Bayesian estimation of the parameter of Maxwell distribution under different loss functions. Journal of Statistical Theory and Practice 4:279-287.

[7] Tomer, S. K., and Panwar, M. S. (2015). Estimation procedures for Maxwell distribution under type I progressive hybrid censoring scheme. Journal of Statistical Computation and Simulation 85:339-356.

[8] Sharma, V. K., Bakouch, H. S. and Khushboo Suthar, K. (2017a). An extended Maxwell distribution: Properties and applications, Communications in Statistics - Simulation and Computation, 46:9, 6982-7007.

[9] Sharma, V. K., Dey, S., Singh, K. S. and Manzoor, U. (2017b). On Length and Area biased Maxwell distributions, Communications in Statistics - Simulation and Computation, DOI: 10.1080/03610918.2017.1317804.

[10] Renyi A. (1961). On measures of entropy and information, in: Proceedings of the 4th Berkeley Symposium on Mathematical Statistics and Probability, University of California Press, Berkeley.

[11] Lindley, D. V., (1980). Approximate Bayes method, Trabajos de estadistica, Vol. 31, 223-237, 1980.

[12] Lee, E. T. and Wang, J. W. (2003). Statistical Methods for Survival Data Analysis. Wiley, New York, DOI:10.1002/0471458546.

[13] Murthy D. N. P., Xie M., Jiang R. (2004). Weibull model. Wiley, New York.

[14] Chhikara, R. S. and Folks, J. L. (1977). The inverse gaussian distribution as a lifetime model, Technometrics 19: 461-468.

[15] Bonferroni C. E. ( 1930), Elementi di Statistica General, Seeber, Firenze.

[16] Choulakian, V., Stephens, M. A. (2001). Goodness-of-fit tests for the generalized Pareto distribution, Technometrics 43(4), 478-484.

i Надоели баннеры? Вы всегда можете отключить рекламу.