INFERENCE ON THE INVERSE POWER BURR-HATKE DISTRIBUTION UNDER TYPE II CENSORING
Pavitra Kumari
Department of Statistics, Central University of Haryana, India [email protected]
Vinay Kumar
Department of Mathematics and Statistics, Chaudhary Charan Singh Haryana Agricultural
University, India [email protected]
Abstract
There are many real-life situations, where data require probability distribution function which have decreasing or upside-down bathtub (UBT) shaped failure rate function. The inverse power burr hatke distribution consists both decreasing and UBT shaped failure rate functions. Here, we address the different estimation methods of the parameter and reliability characteristics of the inverse Pareto distribution from both classical and Bayesian approaches. We consider classical estimation procedures to estimate the unknown parameter of inverse power burr-hatke distribution, such as maximum likelihood. Also, we consider Bayesian estimation using squared error loss function based joint priors. The Monte Carlo simulations are performed to compare the performances of the obtained estimators in mean square error sense. Finally, the flexibility of the proposed distribution is illustrated empirically using one real-life datasets. The analyzed data shows that the introduced distribution provides a superior fit than some important competing distributions such as the Weibull, inverse Pareto and Burr-Hatke distributions.
Keywords: Burr-Hatke Distribution, Inverse Power Burr- Hatke Distribution, Type II censoring, Bayesian estimation, Lindley's Approximation technique.
I. Introduction
Statistical distributions can be used to model many real-life scenarios, such as reliability, actuarial science, survival analysis and lifetime data. Different lifetime distributions have been introduced in the statistical literature to provide greater flexibility in modelling data in these applied sciences. One of the important features of generalized distributions is their capability for providing superior fit for various life-time data encountered in the applied fields. Hence, the statisticians have been interested in constructing new families of distributions to model such data.
Recently, several new distributions and regression models to provide inferences on these distributions have been developed for modeling health and biomedical data, among other fields. Some distributions and classes of distributions developed include exponentiated Burr XII Poisson distribution by da Silva et al. [1], Weibull Burr XII (WBXII) distribution by Afify et al. [2], odd log logistic Topp-Leone G family of distributions by Alizadeh et al. [3], Burr-Hatke exponential (BHE)
distribution by Abouelmagd [4] and Yadav et al. [5], odd generalized gamma-G family of distributions by Nasir et al. [6], Chen-G family of distributions by Anzagra et al. [7], inverse-power Burr-Hatke distribution by Afify et al. [8], harmonic mixture Weibull-G family of distributions by Zamanah et al. [9], harmonic mixture G family of distributions by Kharazmi et al. [10] and Alshenawy R. [11] studied Progressive Type-II Censoring Schemes of Extended Odd Weibull Exponential Distribution with Applications in Medicine and Engineering. Ahmed et. al. [12] studied Bayesian and Classical Inference under Type-II Censored Samples of the Extended Inverse Gompertz Distribution with Engineering Applications. Hassan [13] studied Statistical Inference of Chen distribution Based on Two Progressive Type-II Censoring Schemes.
Burr Hatke model provides only a decreasing hazard rate (HR) shape; hence, its use will be limited to modelling the data that exhibits only increasing failure rate. IPBH model can accommodate right-skewed shape, symmetrical shape, reversed J shape and left-skewed shape densities. Its hazard rate (HR) can be an increasing shape, a unimodal shape, or a decreasing shape. IPBH distribution provides more accuracy and flexibility in fitting engineering and medicine data. The IPBH distribution was constructed using the inverse-power (IP) transformation. The aim of this article is to develop the classical and Bayesian estimation procedures for the parameters of the IPBH. The rest of the article is organized as follows: IPBH is discussed in Section 2. Also, mathematical formulation is given for type II censoring with failure and censoring time distributions in this section. Section 3 deals with the maximum likelihood estimation and asymptotic confidence intervals of the parameters. Section 4 describes asymptotic confidence interval. Sections 5 describe the formulation of Bayes estimation procedure using Markov chain Monte Carlo (MCMC) methods under SELF loss function using gamma informative priors. Section 6 deals with a Monte Carlo simulation study to explore the properties of various estimates developed in this article. Real life dataset is analyzed for illustration purposes in Section 7. Finally, conclusive remarks are given in section 8. Also, it is essential to mention that the statistical software R 3.5.2, [R Core Team (2018)] is used for computation purposes throughout the article.
II. The Model
If a random variable X follows IPBH with parameter (A, 9) the cdf is given by:
F(x:№= exp(~" x-9), ie>o
v J x-s+1
Therefore, the corresponding probability density function is given by
6 exv(-Ax~0)[A + (1 + X)]x~e f(x:d,X)=-—- . , ---, ¿,8>0
x(x° + 1)2
(2.1) (2.2)
Where 9 and A are shape parameters, respectively.
Figure 1. Possible density shapes of the IPBH distribution for several values of A and 6.
The survival function (SF) and HR function of the IPBH distribution take the following forms, respectively:
S(x: A,6) = 1- (2.3)
_ 9[Л + (Л+1)х~в]
h(X] A'в) = + 1)[(** + 1) exp(-Ax-e) - x0] (2.4)
Figure 2. Possible failure rate shape of the IPBH distribution for values of A =0.75 and 6 = 3
Figure 3. Possible failure rate shape of the IPBH distribution for values of A =0.5 and 6 = 0.25
Figure 4. Possible failure rate shape of the IPBH distribution for values of A = 10 and 6 = 0.75
III. Maximum Likelihood Estimation
In the literature, Several censoring schemes have been discussed. Even though, Type-I and Type- II censoring schemes are most popular censoring.Consider a life test where n independent units taken from a IPBH distribution are placed under observation and failure time of each unit is recorded.Suppose that the test is terminated when rth, (1 < r < n), r is prefixed unit fails.These
observed failure times, say (x^ x2,....., xr) is a Type-II censored sample of size r. In this censoring
scheme n-r units remain unobserved and survive beyond the time of termination.In Type-II censoring the time of termination is a random variable nad the likelihood function based on (xp x2,...., xr) is given by Cohen [14].
¿Mx)=(^nL/^1 - F(x«)]n-r (2.5)
Assume that n independent observed values taken of IPBH distribution as presented in (2) are put on a test.Using the Type-II censoring, we obtained the ordered r failures. If the ordered r failures are
then (xx,x2,.....,xr) the likelihood function of (A,9) under Type-II censored data drawn of an IPBD
distribution, is obtained as follows:
n
(n-r)I It=oJ
n T—rr r
¿m*) = (^IL/^1 - F(x«)]
r
L(A,0|x) = rZo#(60 - +^log[A+ (A+ 1)xf] -2^log(xf + 1) - ^log(xj) - ^
¿=i
MLEs of A and 9 is a solution of equation (2.5) accomplished by addressing the first partial derivatives of the total log-likelihood to be zero.So,we consider the equation as follows,
r
dZogL y xf +1 i y e (n - r)
= 1 +yx*+.
dA ^A+(A+1)xf £ (^ + 1)^ - xfr)
¿=i ¿=i v J 1 ¿=i 1
The closed form solutions to the nonlinear Equations are difficult to reach and a numerical method must be applied to solve these simultaneous equation for obtaining the MLE of A and 9.
IV. Asymptotic Confidence Intervals
The maximum likelihood estimators of the unknown parameters are not in closed form, it is not easy to drive the exact distributions of the MLEs. Thus, we use the asymptotic distribution of MLEs for the constructions of asymptotic confidence intervals of the parameters based on observed Fisher information matrix. Let a= ( X, 0), be the MLE of a = (A, 0).The observed Fisher information matrix is given by:
rdlnL(8,A) dlnL(8,A)"|
dA2 dAd8
dlnL(8,A) dlnL(8, A)
d8dA d82 dlnL(8,A) ^ (xf+1)2 (rc-r)(xfr) + 1)x(er)eAx--)
= 1
+
2
^ ((A + (A + 1)xf)) - (Xer) + 1)e^)
r_elog (xt) (A+1)xf(xf 1^(A+(A+1)xf (A+(^+1)xf)2 ; f=i
Thus, the observed variance-covariance matrix becomes /"(a).The asymptotic distribution of MLE
3lnL(8,A)_V / x_elog (x,) (A+1)xf(xf+ 1)log (xQ\ y _ _
d8dA = Z,U+tt+1)x? fnrui^2 I Z,*' ( '0g(Xi))
r
=1
=1
=1
Pavitra Kumari, Vinay Kumar RT&A, No 1 (77) INFERENCЕ ON THE INVERSE POWER BURR-HATKE DISTRIBUTION..._Volume 19, March 2024
a is a bivariate normal distribution as aN (0, /-(a)). Consequently, two sided equal tailed 100(1-^) asymptotic confidence intervals for the parameters A and 9 are given by A + Z^var(A) and
0 + Z^ jvar(0)
respectively. Here, Var (A) and Var (0) are diagonal elements of the observed
th
2 ~ ~ v 2'
variance-covariance matrix / (a) and Z^ is the upper (z^) percentile of the standard normal distribution.
V. The Bayesian Estimation
In this section, we discuss the Bayes estimators of the unknown parameters of the model in (2) under square error loss function (SELF). In order to select the best decision in decision theory, an appropriate loss function must be specified. SELF is generally used for this purpose. The use of the SELF is well justified when over estimation and under estimation of equal magnitude has the same consequences. When the true loss is not symmetric with respect to over estimation and under estimation, asymmetric loss functions are used to represent the consequences of different errors. If all parameters of the model are unknown, a joint conjugate prior for the parameters does not exist. In such conditions there are numerous ways to choose the priors. Hence, we choose to consider the piecewise independent priors. The proposed priors for the parameters A and 9 may be taken as:
5l(A) = Aai-1e-Afci, a1,ft1> 0
Thus, the joint prior distribution of A and 9 can be written as:
5(A, 0) = A°i-10ai-1 e-(Afci+efci) (4.1)
Now we derive the Bayes estimators for the unknown parameters A and 9 under squared error loss function. If is the parameter to be estimated by an estimator fi then the squared error loss function is defined as fi ) =(^ - fi)2. The joint posterior distribution of A and 9 after simplification is:
t—■—)Aai-10a2-1e(A&1 + 0&2)nr=o/(*)(! - F(*))n-r
n(A,0|x)=-- (4.2)
J0 /0 (^--7) A«i-10«2-1e(A&1 + 0^) nF=o/(*i)(1 - F(*))"-r 5A50
Therefore, the Bayes estimator of any function of A and 0, say a(A,0 ) under squared error loss function is. I. Subsection One Lindley's Approximation
It is difficult to compute Eq. (4.2) analytically. Lindley's [15] approximation is used to compute the ratio of integrals of the form Eq. (4.3). Based on Lindley's approximation, the approximate Bayes estimator of A under the squared error loss function is:
Bindley = ^ + 1 [^1(2P1a11 + 2P2^21 + tf21L111 + 2a12a21L111 + ^11^22L211 + ^12^22L222)] (4.4) 12
0lindley = 0 + J [^2(2P2^22 + 2P1^21 + ^2L222 + 2a12°11L111 + 3^12^22L122)] (4.5)
Here L(A, 0) is the log-likelihood and p(A, 0) is the log of prior distribution rc(A, 0), A and 0 are the MLEs of A and 9 respectively.
VI. Simulation Study
This section deals with a Monte Carlo simulation study. Here, we compare various estimators developed in the previous sections with the help of Monte Carlo simulation study. Six different sample sizes n = 50, 60, 70, 80 and 90 are considered in the simulation study. Following combination
2
of the true values of the parameters (A, 9) = (0.5, 1) and (A, 9) = (1.5, 1) are taken. In each case the ML and Bayes estimates of the unknown parameters are computed. The whole process is simulated 1000 times. Tables 1-2 report the simulation results including Average Estimate (AE), MSE of the IPBH parameters.
Table 1: Bayes estimate of the parameter A and 6 when 6 = 1 and A = 0.5
n r Priori Prior2 Prior1 Prior2
0
AE MSE AE MSE AE MSE AE MSE
50 46 0.5332 0.018 0.5714 0.0152 1.0862 0.0082 1.0778 0.2459
50 48 0.5321 0.0137 0.5318 0.0142 1.0571 0.0715 1.0754 0.0821
60 56 0.5263 0.0124 0.5268 0.0122 1.0655 0.00615 1.0553 0.00567
60 58 0.5195 0.0102 0.5257 0.0099 1.0525 0.0516 1.0529 0.0588
70 66 0.5173 0.0089 0.5224 0.0091 1.0491 0.0506 1.0551 0.0511
70 68 0.5171 0.0084 0.5223 0.0092 1.0468 0.0485 1.0492 0.0492
80 76 0.5168 0.0071 0.5152 0.007 1.0423 0.0447 1.0468 0.0429
80 78 0.5156 0.0061 0.5187 0.0074 1.0387 0.0394 1.0271 0.0366
90 86 0.5078 0.0054 0.5162 0.0063 1.0311 0.0343 1.0327 0.0364
90 88 0.5115 0.0049 0.511 0.0053 1.0296 0.0316 1.0329 0.0349
Table 2: Bayes estimate of the parameter A and 6 when 6 = 1 and A = 1.5
n r Prior1 Prior2 Prior1 Prior2
0
AE MSE AE MSE AE MSE AE MSE
50 46 1.7311 0.464 1.7088 0.394 1.0504 0.0528 1.0498 0.0492
50 48 1.6536 0.2447 1.6543 0.2456 1.0369 0.0407 1.0439 0.0423
60 56 1.6038 0.1654 1.6382 0.1643 1.0295 0.0313 1.0453 0.0338
60 58 1.5855 0.1197 1.5290 0.1250 1.0244 0.0286 1.0306 0.0298
70 66 1.5841 0.1195 1.5771 0.1181 1.0258 0.0248 1.0221 0.0258
70 68 1.5723 0.0956 0.1181 0.0922 1.0243 0.0244 1.0202 0.0231
80 76 1.5636 0.0958 1.5771 0.1127 1.0143 0.0199 1.0285 0.0237
80 78 1.573 0.0856 1.5639 0.0827 1.0228 0.0198 1.0191 0.002
90 86 1.5614 0.0821 1.5587 0.0792 1.0186 0.0182 1.0162 0.0198
90 88 1.5534 0.0712 1.5489 0.0710 1.0199 0.0171 1.0276 0.0175
VII. Real-Life Applications
In this section, we illustrate estimation procedures discussed in the previous sections with the help of one real datasets. Here, we consider a real dataset namely the strengths of glass fibres The Data I, respectively are given below: Data set:
This dataset consists of 63 observations which are generated to simulate the strengths of glass fibres [18].The 63 observations of the dataset are as follows: "1.014, 1.081, 1.082, 1.185, 1.223, 1.248, 1.267, 1.271, 1.272, 1.275, 1.276, 1.278, 1.286, 1.288, 1.292, 1.304, 1.306, 1.355, 1.361, 1.364, 1.379, 1.409, 1.426, 1.459, 1.460, 1.476, 1.481, 1.484, 1.501, 1.506, 1.524, 1.526, 1.535, 1.541, 1.568, 1.579, 1.581, 1.591, 1.593,
1.602, 1.666, 1.670, 1.684, 1.691, 1.704, 1.731, 1.735, 1.747, 1.748, 1.757, 1.800, 1.806, 1.867, 1.876, 1.878, 1.910, 1.916, 1.972, 2.012, 2.456, 2.592, 3.197, and 4.121".
We calculate MLEs of the unknown parameters together with some useful measure of goodness-of fit tests for one dataset, namely, the negative log likelihood function -lnL, the Akaike information criterion denoted by AIC = 2k-2lnL, proposed by Akaike [16] and Bayesian information criterion denoted by BIC = kln(n)-2lnL, proposed by Schwarz [17], where k is the number of parameters in the model, n is the number of observations in the given datasets, L is the maximized value of the likelihood function for the estimated model and Kolmogorov-Smirnov (K-S) statistic with its p-value. The best distribution corresponds to the lowest -lnL, AIC, BIC and K-S statistic and the highest p values. The K-S statistic with its p-value is obtained using ks test function in statistical software R. The results of the MLEs and measures of goodness-of-fit tests are reported in Tables 3 and 4, respectively. These results show that IPBH distribution is the best choice for the considered datasets. However, for Data I, according to K-S test IPBH is better than the BH.
Table 3: Data Summary for the Data Set
Min 1st Qu. Median mean 3rd Qu. Max
1.014 1.305 1.526 1.616 1.741 4.121
Table 4: Goodness of Fit criterions on the data set
Distribution Estimates -logL AIC BIC K-S (stat) P-value
IPBD 0 = 5.7408 X =6.03415 15.403 26.8066 39.0929 0.08507 0.7197
BR § =0.2325 0 113.364 222.7286 235.0148 0.77052 < 0.001
Weibull § = 3.0521 § =1.7873 43.254 96.7345 101.12 0.2051 0.009
Exponential § =0.6189 93.222 187.432 190.523 0.4721 < 0.001
VIII. Conclusion
This article deals with the classical and Bayesian estimation procedures for parameters of inverse power Burr-Hatke distribution using second type censoring. The maximum likelihood estimators and corresponding asymptotic confidence intervals based on observed Fisher information matrix of the unknown parameters were derived. The Bayes estimates of the parameters under square error loss function were approximated using Lindley's approximation. The performance of these estimators was examined by extensive Monte Carlo simulation study, which indicated that the MLEs can be obtained easily and quickly with satisfactory estimates. For more efficient estimators, Bayes estimation method with available prior information or convenient non-informative priors in the absence of prior information is recommended.
References
[1] Da Silva, R. V., Gomes-Silva, F., Ramos, M. W. A., & Cordeiro, G. M. (2015). The exponentiated Burr XII Poisson distribution with application to lifetime data. International Journal of Statistics and Probability, 4(4): 112-131.
[2] Afify, A. Z., Cordeiro, G. M., Ortega, E. M., Yousof, H. M. and Butt, N. S. (2018). The four
parameter Burr XII distribution: properties, regression model, and applications. Communications in Statistics-Theory and Methods, 47(11): 2605-2624.
[3] Alizadeh, M., Lak, F., Rasekhi, M., Ramires, T. G., Yousof, H. M., and Altun, E. (2018). The odd log-logistic Topp-Leone G family of distributions: heteroscedastic regression models and applications. Computational Statistics, 33(3): 1217-1244.
[4] Abouelmagd, T. H. M. (2018). The Logarithmic Burr-Hatke Exponential Distribution for Modeling Reliability and Medical Data. International Journal of Statistics and Probability, 7(5): 73-85.
[5] Yadav, A. S., Altun, E. and Yousof, H. M. (2021). Burr-Hatke Exponential Distribution: A Decreasing Failure Rate Model, Statistical Inference and Applications. Annals of Data Science, 8(2): 241-260.
[6] Nasir, M. A., Tahir, M. H., Chesneau, C., Jamal, F., and Shah, M. A. A. (2020). The odds generalized gamma-G family of distributions: Properties, regressions and applications. Statistica, 80(1): 3-38.
[7] Anzagra, L., Sarpong, S., and Nasiru, S. (2020). Chen-G class of distributions. Cogent Mathematics and Statistics, 7(2):1721.
[8] Afify, A. Z., Aljohani, H. M., Alghamdi, A. S., Gemeay, A. M. and Sarg, A. M. (2021). A new two-parameter burr-hatke distribution: properties and bayesian and non-bayesian inference with applications. Journal of Mathematics. 2021: 16. [9] Zamanah, E., Nasiru, S., and Luguterah, A. (2022). Harmonic Mixture Weibull-G Family of Distributions: Properties, Regression and Applications to Medical Data. Computational and Mathematical Methods, 2022: 24.
[10] Khaazmi, O., Nik, A. S., Hamedani, G. G., and Altun, E. (2022). Harmonic Mixture-G Family of Distributions: Survival Regression, Simulation by Likelihood, Bootstrap and Bayesian Discussion with MCMC Algorithm. Austrian Journal of Statistics, 51(2): 1-27.
[11] Altun, E., Alizadeh, M., & Yousof, H. M. (2022). The Odd Log-Logistic Weibull-G Family of Distributions with Regression and Financial Risk Models. Journal of the Operations Research Society of China, 10(1): 133-158.
[12] Ahmed, E., Hassan M. A. and Ahmed, Z. A. (2021). Bayesian and Classical Inference under Type-II Censored Samples of the Extended Inverse Gompertz Distribution with Engineering Applications. Entropy, 23: 1578.
[13] Hassan, M. A. (2021). Statistical inference of chen distribution based on two progressive type-ii censoring schemes. Computers, Materials & Continua, 66: 2797-2814.
[14] Cohen, A. C. (1965). Maximum Likelihood Estimation in the Weibull Distribution Based on Complete and Censored Samples. Technometrics, 7: 579-588.
[15] Lindley, D. V. (1980). Approximate Bayesian methods (with discussions), Trabajos de Estadistica. 31: 232-245.
[16] Akaike, H. A. (1974). New look at the statistical model identification. IEEE Transactions on Automatic Control, 19: 716-723.
[17] Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6: 461-464.