Научная статья на тему 'A NOVEL EXTENSION OF INVERSE EXPONENTIAL DISTRIBUTIONS: A HEAVY-TAILED MODEL WITH UPSIDE DOWN BATHTUB SHAPED HAZARD RATE'

A NOVEL EXTENSION OF INVERSE EXPONENTIAL DISTRIBUTIONS: A HEAVY-TAILED MODEL WITH UPSIDE DOWN BATHTUB SHAPED HAZARD RATE Текст научной статьи по специальности «Науки о Земле и смежные экологические науки»

CC BY
29
10
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
Inverse Exponential distribution / Modi Inverse Exponential distribution / Moments / Tail Behaviour / Order Statistics / Parameter estimation

Аннотация научной статьи по наукам о Земле и смежным экологическим наукам, автор научной работы — Jabir Bengalath, Bindu Punathumparambath

Heavy-tailed distributions have garnered interest due to their advantageous statistical and reliability characteristics, rendering them valuable in applied fields such as economics, finance, and risk management. Such distributions offer robust properties, making them pertinent to studies in various areas like econometrics, statistics, and insurance. Thus, the primary objective of this paper is to propose a Two parameter right skewedupside down bathtub type, heavy tailed distribution, which is a generalisation of Inverse Exponential distribution and is referred to as Modi Inverse Exponential distribution. We derive several mathematical and statistical features, including quantile function, mode, median, skewness, kurtosis, and mean deviation. Additionally, the reliability and hazard rate functions are also derived. Stochastic ordering and order statistics of the proposed distribution were derived. We also investigate the tail behaviour of the proposed model. Furthermore, estimation methods such as maximum likelihood estimation and its asymptotic confidence bound, percentile method, and Cramer-von-Mises method were examined. To demonstrate the appropriateness of the suggested model, we have considered two distinct real datasets along with three distinct models and concluded that the proposed model is more adaptable.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «A NOVEL EXTENSION OF INVERSE EXPONENTIAL DISTRIBUTIONS: A HEAVY-TAILED MODEL WITH UPSIDE DOWN BATHTUB SHAPED HAZARD RATE»

A NOVEL EXTENSION OF INVERSE EXPONENTIAL DISTRIBUTIONS: A HEAVY-TAILED MODEL WITH UPSIDE DOWN BATHTUB SHAPED HAZARD RATE

Jabir Bengalath

Department of Statistics, Govt. Arts and Science College, Calicut, University of Calicut, Kerala, India.

jabirbengu@gmail.com

Bindu Punathumparambath

Department of Statistics, Govt. Arts and Science College, Calicut, University of Calicut, Kerala, India.

ppbindukannan@gmail.com

Abstract

Heavy-tailed distributions have garnered interest due to their advantageous statistical and reliability characteristics, rendering them valuable in applied fields such as economics, finance, and risk management. Such distributions offer robust properties, making them pertinent to studies in various areas like econometrics, statistics, and insurance. Thus, the primary objective of this paper is to propose a Two parameter right skewed- upside down bathtub type, heavy tailed distribution, which is a generalisation of Inverse Exponential distribution and is referred to as Modi Inverse Exponential distribution. We derive several mathematical and statistical features, including quantile function, mode, median, skewness, kurtosis, and mean deviation. Additionally, the reliability and hazard rate functions are also derived. Stochastic ordering and order statistics of the proposed distribution were derived. We also investigate the tail behaviour of the proposed model. Furthermore, estimation methods such as maximum likelihood estimation and its asymptotic confidence bound, percentile method, and Cramer-von-Mises method were examined. To demonstrate the appropriateness of the suggested model, we have considered two distinct real datasets along with three distinct models and concluded that the proposed model is more adaptable.

Keywords:Inverse Exponential distribution, Modi Inverse Exponential distribution, Moments, Tail Behaviour, Order Statistics, Parameter estimation.

1. Introduction

During the recent years, heavy-tailed distributions have gained attention as an attractive subject for various research and studies. References to some works on these distributions can be found in [1],[2],[3],[4],[5]. These distributions possess excellent statistical and reliability properties, making them practical for many applied sciences such as economics, finance, econometrics, statistics, risk management, and insurance. Several authors have developed inferential results under financial modeling, as seen in [6],[7],[8],[9],[10]. There exist various heavy-tailed distributions in many practical situations, such as financial sciences, reliability engineering and bio-medical science,data are usually positive, and their distribution is uni-modal hump shaped and extreme values yielding heavier tails than the classical models. For example, in health science research (1). The medical

expenditure that exceed a given threshold and (2). Length of stay in hospital, presents highly skewed, heavy tailed data for which standard classical distributions and simple variable transform are insufficient to provide an adequate fit to such data. The Exponential distribution has been widely used for analyzing life-time data. However, its usefulness is restricted to scenarios with a constant hazard rate, which can be difficult to justify in practical situations. To address this problem, so many alternative models for life-time data have been developed. Among them, distributions like Weibull and gamma have been extensively employed when dealing with lifetime data exhibiting a monotonically increasing or decreasing hazard rate. The UBT failure rate distributions commonly appear in medical and biological fields like in lung cancer patient data (see [11], in bladder cancer patient data (see [28] ) and in breast carcinoma patient data (see [12]. The inverse transform method is a widely used approach to derive the inverse form of different lifetime distributions. The distribution family obtained by this method, known as the generalized inverted family, often exhibits the characteristic"upside-doun bathtub" hazard rate pattern. These distributions have the advantage of being the number of parameters required and are straightforward to apply. For instance, Notable examples of such a distribution include the inverted gamma distribution (IED) proposed by Lin et al[13] and the Inverse Lindley Distribution (ILD), introduced by Sharma et al[14]. In addition,The transmuted inverse exponential distribution was presented by Oguntunde and Adejumo [15]. In the same year, Khan et al [16] propose the transmuted inverse Weibull distribution. In addition, in 2014, Sharma et al [17]introduce the transmuted inverse Rayleigh distribution. Then, in 2016, Sharma et al [18] further introduced the generalized inverse Lindley distribution. The purpose of this study is to propose a new inverted probability model with UBT type of failure rate. For this purpose , we consider A one parameter Inverse exponential distribution. Let X be the random variable having the probability density function (pdf) is given by

9 e

g(x) = —2 e-x, x > 0, 6 > 0. (1)

x

and the cumulative distribution function (cdf) is

G(—) = e-—, — > 0, 6 > 0. (2)

for all x > 0 , where 6 > 0. Furthermore, K. Modi et al [19] introduced a new family of distributions name Modi family of distribution in his paper titled a new family of distributions with applications on two real data sets on the survival problem. where he proposed a new probability distribution by taking base line distribution by exponential distribution with one parameter. This paper aims to substitute a^ with y within the established family, resulting in a modified probability density function(pdf) and cumulative distribution function(cdf) is given by.

f (x) = (1 + 7)(7g(x)) Y > 0 (3)

fY(x) = (7 + G(x))2 , y > a (3)

and

FY(x)==(-1+G—1 Y > 0. (4)

where g(x) and G(x) are the pdf and cdf of the baseline distribution. Moreover, We can easily verified that the given Modi family satisfies the identifiable properties and other properties which is required for a probability distribution. Hence, this familiy of distribution can be used to generate more flexible probability distributions. The hazard function of equation (1) is given by,

hY (x)

1 + y

Y + G(x)

hx(x), Y > 0.

Where hx(x) is the hazard function of the baseline distribution.

Acknowledging the need for more flexible lifetime distributions, we introduce a new family of probability distribution known as the Modi Inverse Exponential distribution with two parameters. which can be extensively used to fit and analyze data in a variety of field. The paper is

organized as follows: Section 2 specifies the Modi Inverse Exponential distribution, whereas Section 3 provides the Modi Inverse Exponential distribution Properties, which include the Hazard function, Survival function, Quantile function,Mode, Median, Skewness, Kurtosis, Mean Deviation, Stochastic Ordering, Order statistics. In Section 4, we look at the Modi inverted exponential distribution's tail behaviour. Section 5 We investigated different method of estimation which includes Maximum likelihood estimation and its asymptotic confidence bound, Percentile method and Cramer-von Mises method is discussed Section 5, we conduct a simulation study to validate the proposed model's estimations, and two real data sets are analysed to demonstrate the efficacy of the proposed model. The conclusion is provided in Section 6.

2. Modi Inverse Exponential Distribution

A random variable X is said to have Modi Inverse Exponential distribution (MIE) if its cumulative distribution function (cdf) is given by

F(x) = (y + , x > 0, y > 0, 9 > 0. (5)

1 + ye*

and its pdf is,

f (x) = 9y(y+ 1)gf , x > 0, y > 0, 9 > 0. (6)

(xyex + x)2

The shape of the distribution might provide important insights into its characteristics, such as whether it is symmetrical or skewed. In this context, the MIE(y, 9) distribution is represented by its cumulative distribution function (cdf) in Figure 2 and its probability density function (pdf) in Figure 1 for different values parameter.

Figure 1: Pdf plot of MIE distribution for different parameter values

Theorem 1. Given that X follows the MIE (y, 9) distribution with f (x) and F(x) as given in (6) and (5) respectively, then:

Figure 2: cdf plot of MIE distribution for different parameter values

1. f (x) = 0

2. limx^M F(x) = 1

proof. Trivial and hence omitted.

3. Some Statistical Properties of MIE Distribution

This section contains various statistical features associated with the new distribution that have been derived.

3.1. Hazard Rate Function

The hazard function characterizing a specific phenomenon elucidates the inherent nature of the failure rate that is associated with lifetime of the specific equipment. For the cdf and pdf provided in equations (5) and (6), respectively, the expression for h(x) is as follows:

m = I7 +!,1*!+n (7)

x;(ex - 1)(7ex + 1)

The hazard rate function plot in Figure 3 shows various curves indicating different values of the parameters 7 and 9. We can gain useful insights about the nature of the model's failure rate by using this visual depiction, which reveals a distinct right-skewed pattern and UBT type failure model.The mathematical verification of this assertion may also be established through the utilization of the outcome presented by Glaser [20]. Glaser demonstrated that the Condition for UBT can be established if and only if the following conditions are met: f(t) > 0 for all t € (0, to), f(to) = 0, f(t) < 0 for all t > t0, and satisfying limt^0 f (t) = 0 where f is equal to — and f (t) is the first derivative of the density function with respect to t. For our proposed model, it is evident that.

f(t) = — ;t(1 + gf Y)+e9 — Y9

t; (1 + et y)

and

;((t + e Uy)2 — e 9 y9; + t(9 — e29 y; 9)

cp' (t)

t4(1 + e ? 7)2

Figure 3: Hazard plot ofMIE distribution for different parameter values

Since, the above equation is not provided in an explicit form to derive the solution, a simulation study was performed.It was observed that for to ~ 0.4488, q>'(t) > 0 for all t € (0, to), q>'(t0) = 0, q>'(t) < 0 for all t > t0. Also, from Equation (6), we verified that limx^0 f (t) = 0. Therefore, it can be deduced that the MIE(y,9) distribution proposed exhibits a right-skewed distribution, which is characterized by an UBT shape of hazard rate. This distribution is particularly useful when analyzing medical and reliability data.

3.2. Survival Function

The survival function describes the probability that a unit, component, or individual will not fail at a given time. The expression for survival function S(x) is stated as follows, and its corresponding survival plot is presented in Fiqure 4:

S(x) = , (8)

yex +1

Theorem 2. The limit of the hazard rate function of MIE(y, 9) distribution as x ^ to is zero.

i.e, lim h(x) = 0.

Proof. Trivial and hence omitted. ■

3.3. The Odd Function

The Odd Function is obtained using the relation Q(x) = ^x) and is given by

Q(x) = , (9)

y(e* - 1)

Figure 4: Survival plot of MIE distribution for different parameter values

3.4. Reverse Hazard Rate Function

The Revised Hazard Rate Function is obtained by using the relation,

$(x)

fJA

F(x) djex

x2(jex +1)

3.5. Cumulative Hazard Function

The Cumulative Hazard Function is obtained using the relation,

C(x) = - log(S(x))

- log

j(ex -1)

« , 1

jex +1

3.6. Quantile Function, Skewness and Kurtosis

The MIE distribution can be simulated using the inverse cdf method,

X

0

ln

î7+1-

V Yu

(10)

(11)

(12)

where, u is a uniform random variable, 0 < u < 1. The qth quantile of the MIE distribution is

obtained as:

xq —

0

(^J

ln

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

(13)

By making use of equation (13), we are able to calculate the first and third quartiles by substituting q = 0.25 and q = 0.75, correspondingly. Once we have obtained these values, we can subsequently

u

calculate Galton's [[21]] skewness (Sk) and Moor's [[22]] Kurtosis (Kr) by means of the given formulae.

The measure of skewness Sk,

S = Q(6/8) - 2Q(4/8) + Q(2/8)

S Q(6/8) - Q(2/8) , ( )

and the measure of Kurtosis, Kr

K = Q(7/8) - Q(5/8) + Q(3/8) - Q(1/8) (15)

Kr = Q(6/8) - Q(2/8) , (15)

3.7. Median

Since the distribution proposed is a heavily tailed, right-skewed distribution, the most appropriate measure of central tendency is the median.The median of the proposed distribution can be obtained by utilizing q = 0.5 in the quantile function, as delineated in equation (13). So median

Md

v e

Md

ln (7+1-0-5

m\ 7(0-5)

3.8. Mode

(16)

if a random variable X has the PDF given by equation (6), then the corresponding mode is given by f'(x) = 0, thus we obtain

ex9y(1 + y) (ex (9 - 2x) - y(9 + 2x)) f (x) =-^-T3-Z = 0

x4

(e°x + Yj x4 (ex (9 - 2x) - y(9 + 2x) j = 0

For various values of y and 9, we can estimate the value of x by using an optimization technique in R. If y = 3 and 9 = 4, we obtain the mode as 1.93755.

3.9. Mean Deviation

The mean deviation from the median is a statistical measure, serves as an indicator of population dispersion. Let "M" stand in for the median of the MIE Distributions specified in equation (16). The mean deviation from the median may be computed as follows:

p(x) = E\x - M| = j |x - M|f (x) dx,

it can be obtained the following equation p = ^ - 2W(M) where W(M) is

r M ex

W (M) = 9y(y + 1) —^^-- dx, (17)

j0 x2(yex +1)2

This integral may be readily computed numerically using tools such as R, MATLAB, Mathcad, and others. Thus, obtaining the mean deviation from the median is desired.

Figure 5: Mode plot of MIE distribution for different parameter value

3.10. Stochastic Ordering

Let X! and X; be random variables with cumulative distribution functions (cdf's) F1(x) and F;(x), respectively. X! is said to be stochastically greater than or equal to X; if Fi(x) < F; (x) for all x. (see Gupta et al [;3] for more detail).

Theorem 3. Let X! ~ MIE(71,91) and X; ~ MIE(y;,9;). X! is said to be stochastically greater than X; if 71 = 7; = 7 and 91 > 9;.

Proof. Let's consider 9i > 9; and 71 = 7; = 7, the ratio simplifies to:

7+1

F1(x) _ 1+7e^

F;(x)

1+7e x

Since 1 > 2, we have:

h

e x ^ > 1

e x

Therefore, when 91 > 9; and 71 = 7; = 7, F1 (x) is stochastically smaller than F;(x) for all

x > 0. ■

3.11. Order Statistics

In this section, we derive a compact expression for the pdf of the ith order statistic of the Modi inverse exponential distribution. Let X1, X;, X3,..., Xn be a simple random sample from the Modi inverse exponential distribution with cdf and pdf given by equations (5) and (6), respectively. Let X1:n < X2:n < X3:n < ... < Xn:n denote the order statistics obtained from this sample. We now give the pdf of Xr:n, denoted as fr:n (x), and the rth moments of Xr:n, for i = 1,2,..., n, which are given by:

fr:n(x) = Cr:n [F(x; 7,9)]r—1 [1 — F(x; 7,9)]n—rf (x; 7,9), (18)

for all x > 0, where F and f are given by equations (5) and (6), respectively, and Cr:n = (n-1y(n-r)i• Thus, using the binomial series expansion:

to /„, 1 \

(1 - x)a-1 = £(-iyfa . ^xj• j=0 \ J /

We obtain:

TO

s=0 v S

fr:n(x) = Cr:n £(-1)s( s ) [F(x; Y,0)]r+s-1 f (x; Y,Q),

r+s-1

TO 'n-r

Cr:n £ (-1)

s=0 v s

Y +1

1 , ^ 1 + Y^x

qy(y + 1)ex (xy^X + x)2

Cr:n £(-1)^n r) QY(Y + 1)r+s(1 + YeX )r+s-1 eX (xYeX + x)-2. (19)

s=0 s

4. Tail Area Property

According to Klugman et al [24]and Nair et al [25] a distribution is classified as a heavy-tailed distribution when it displays the heavy tail property. These types of distributions are characterized by the lack of one or more orders of moments. Specifically, the absence of the first moment, which represents the distribution's arithmetic mean, indicates the presence of the distribution's heavy tail property. The proposed distribution's arithmetic mean can be derived by solving:

Q

fro ex

Qy(y + 1) --e-dx,

j0 x(Yex +1)2

which is a divergent integral,then the arithmetic mean of the corresponding distribution cannot be determined. Consequently, based on this criterion, the proposed distribution can be classified as a heavy-tailed distribution. Another method for evaluating the heavy tail attribute of the distribution is to examine whether the ratio of the hazard rate to x approaches zero as x approaches infinity; if it does, then the distribution displays the characteristic of a heavy-tailed distribution. For the proposed distribution:

(Y + 1)QeX ^ 0 x3(ex - 1)(Yex + 1)

This fact can be proven by implementing L'Hpital's rule. As a result, the distribution put forward exhibits a heavy-tailed distribution.

In our analysis, we additionally consider the examination of the heavy tail characteristic of the distribution via an alternative methodology. This methodology involves the observation of the ratio of two survival functions. If the ratio of survivals approaches infinity as x approaches infinity, then one survival function is considered to be heavier than the other. Moreover, the limiting case of the ratio of two survival functions provides the limiting case of two probability density functions. Therefore, this ratio also indicates the ratio of two density functions.

lim SjM = lim S1M = lim

x^ro S2(x) x^to S2(x) x^ro f2(x)

here, we conduct a comparison between the proposed distribution and the Pareto Type II distribution. The survival function of the Pareto Type II distribution is expressed as S(x) = P(X > x) = (1 + j) a. For a > 1, the ratio between the two distributions tends to infinity as x approaches infinity. This suggests that the tail of the proposed distribution. Figure 6 provides the plot of the tail density for the proposed distribution in comparison with two other distributions, namely the normal distribution and the Parato Type II distribution.

Figure 6: Tail behaviour of Normal, Parato type II and MIE densities Remark 1: A distribution is said to be heavy-tailed distribution if and only if

!• TO

J eAxf (x) dx = to for all A > 0. Hence MIE(7,0) is heavy tailed becuase it satisfies condition.

Remark 2: if a distribution is said to be heavy-tailed, if the right probabilities are heavier than the exponential distribution if

1 — F(x)

lim -e-^- = to for all A > 0.

x^œ e

-Ax

Hence MIE(7,0) distribution also satisfies this condition.

Definition: An ultimately positive function f is called regularly varying at infinity with index 7 £ R if for any fixed c > 0:

lim ^ = cY.

x^to x

The aforementioned theorem demonstrates that the density function derived from the MIE(y,0) in equation (6) exhibits regularly varying tails.

Theorem 4. The density function of the MIE(y,0) distribution is a function with regularly varying tails.

Proof. Using the density function (6), we have:

lim KCXl = lim eCXx2(YeX + 1)2 = 1 f (x) eX (x + cx)2(7ecx + 1)2 '

applying limits, the above simplifies to:

r f(cx) , lim i^L = 1,

x^œ f (x)

Hence the proof. ■

Definition: An ultimately positive function f is long-tailed and is said to belong to class L if and only if:

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

lim f Ay) = 1, for all y > 0.

x^TO f (x)

Theorem 5. The MIE(y,0)Distribution belongs to the class L. Proof.

lim f (x + y) = e^x2(ye* + 1)2 = 1

^ f (x) el (x + y)2(ye*+y + 1)2 ,

Hence, f belongs to the class L. ■

Definition: An ultimately positive function f belongs to the class D of dominated variation distributions if there exists c > 0 such that:

f(x)

lim r\ \ = c, for all x > 0. x^to f (2x)

Theorem 6. The MIE(y,A) Distribution belongs to the class D of dominated variation distributions. Proof.

lim KxL = (yeA +1)2 = 4

x-moo f (2x) A / A )2 , J v ' e2x lyex + 11

hence f belongs to the class of dominated variation distributions. ■

4.1. Different method of Estimation

In this section, we are looking at three estimation methods for estimating the unknown model

parameters of the proposed model. The procedures are detailed below

4.2. Maximum Likelihood Estimation

Let X be a random variable with the pdf of the Modi inverse exponential distribution defined in equation (6), then its log-likelihood function is defined by:

log L(x; y, d) = n loge + n log y + n log(y + 1)

1 A (20)

+ e E - - 2 E log(x¿yexi + xi) V '

i=0 x' i=0

Thus, the non-linear normal equations are given as follows:

n 1 n

E¿ -2E-

i=0xi i=0 (xi + exi yxi)

a log L(x; y, e) = n + E 1 2 E ex' y

ae =e+E x- 2 E~ (21)

d log L(x; y, 9) n n „A exixi

g L = Y + i + y - 2 ^-x^ (22)

oy y 1 + y i=0 (Xi + exi yxi)

The equations from (21) to (22) above are not in closed form. We refer to using some iterative procedure such as Newton Raphson, Bisection methods, or some other method to obtain the approximate maximum likelihood estimates (MLE) of these parameters for the solution of these explicit equations.

4.3. The Asymptotic Confidence Bounds:

The maximum likelihood estimators (MLE) of the unknown parameters j, 9 in the MIE (7, 9) do not have closed-form solutions. As a result, the exact distribution of the MLE cannot be derived. However, asymptotic confidence bounds can be obtained based on the asymptotic distribution of the MLE. The information matrix is calculated by taking the second partial derivatives of equations (21) to (22) and is given as:

a2 log L(x; j, 9)_ n " -- - 92 - 2 L

d92

a2 log L(x; j, 9) _ n

e i y e i y

p + p~

0\ (Xi + exi YXi )2 Xi (Xi + exi jxi )i

dj2

a2 log L(x; j, 9)

aj9

- 2 L I-

29 2

e xi xi2

j2 (1 + j)2 i-0 y (xi + exi jxi)2

-2 L I-

e xï jxi e xi

p ++ iT

(23)

(24)

(25)

(xi + exi jxi )2 (xi + ex jxi )i

So that the observed information matrix is given by:

I - -

^11 I12 I21 I22

Hence, the variance-covariance matrix is approximated as:

V&I-

V11 V12" V21 V22

To obtain the estimate of V, we replace the parameters by their corresponding maximum likelihood estimators (MLEs) to get:

Vn V12" V21 V22

V

Using the above variance-covariance matrix, one can derive the (1 — fi) 100% confidence intervals for the parameters d and 7 as follows:

j ± Zj\Jvar(j), 9 ± Zj \Jvar(9)

4.4. The Percentile Method

Let X(j) be the jth order statistic, i.e., X(1) < X(2) < ... < X(n). if pj denote some estimate of F(xj; d, y), then the estimate of d and 7 can be obtained by minimizing

L

j-1

xj -

(Y+1-Pj\ V YPj J

ln

with respect to d and 7. Several types of estimators for pj can be used [26], and this paper considers pj = n+T.

4.5. Method of Cramer-von Mises

Cramer-von-Mises type minimum distance estimators aim to minimize the disparity between the theoretical and empirical cumulative distribution functions. It has been demonstrated empirically

n

1

2

9

that these estimators have a lesser bias than other minimum distance estimators. The Cramer-von-Mises estimators yCME, and 9CME, are the values of y, and 9 that minimizing

1

W (7,0) = ^ + E

i=1

F (ti ; 7,0) -

2i - 1 2n

(26)

where ti is the i-th ordered observation, and F(ti; 7,0) is the cumulative distribution function of the MIED with parameters 7 and 0. To estimate the parameters, we differentiate the above equation partially with respect to the parameters 7 and 0, respectively, and equate them to zero to get the normal equations. Since the normal equations are nonlinear, we can use iterative methods to obtain the solutions.

2

n

5. Simulation Study and Data Analysis

5.1. Simulation Study

In this section, we conduct a comprehensive Monte Carlo simulation study that is repeated 1000 times in order to compare the performance of the previously discussed estimators. We evaluate these estimators using Mean Squared Error (MSE) and make comparisons across sample sizes n = 50,100,150,200 for two distinct parameter settings: y = 1 and 9 = 0.05, and y = 0.9 and 9 = 2.5. The simulation-based outcomes provide estimates denoted as yPm, 9pm, yml, 9ml, ycm, and 9cm for the Percentile Method (PM), Maximum Likelihood Estimation (MLE), and Cramer-von Mises (CVM) method. The corresponding MSE values are displayed in parentheses. Notably, as sample size n increases, the Mean Squared Error (MSE) tends to decrease, indicating improved estimation accuracy. Table 1 and Table 2 present the simulation results.

n PM MLE CVM

7pm 0PM 7Ml 0ML 7cm 0CM

50 0.0836 0.0203 0.0013 0.0206 0.0900 0.0102

(0.3486) (6.918 x 10- -04) (8.709 x 10- -04) (5.277 x 10- -04) (0.4250) (0.0050)

100 0.0406 0.0097 0.0005 0.0102 0.0425 0.0049

(0.1623) (1.303 x 10- -04) (6.057 x 10- -04) (1.144 x 10- -04) (0.1819) (0.0026)

150 0.0269 0.0065 0.0004 0.0067 0.0282 0.0033

(0.1069) (6.499 x 10- -05) (1.357 x 10- -04) (5.123 x 10- -05) (0.1195) (0.0017)

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

200 0.0203 0.0049 0.0004 0.0050 0.0213 0.0024

(0.0804) (3.696 x 10- -05) (4.004 x 10- -05) (3.308 x 10- -05) (0.0896) (0.0013)

Table 1: Simulation outcomes obtained for the parameter value of y = 1 and 9 = 0.05 are presented herein. The values enclosed within the parentheses denote the Mean Squared Error (MSE) values.

5.2. Data Analysis

In this section, we demonstrate the usefulness of the proposed Modi Inverse Exponential distribution with parmeter y and 9 . We fit this distribution to a real-life data set and compare the results with some recent efficient models, namely the Inverse Generalized Weibull distribution, Generalized Inverse Generalized Weibull distribution. The corresponding PDFs are presented below:

• Inverse Generalized Weibull Distributions:

f (x,a,A) = apA?e-(A )p x-(p+1) (1 - e-(A)PJ \

• Generalized Inverse Generalized Weibull Distribution:

n PM MLE CVM

7pm 9pm 7Ml 9ML 7Cm C M

50 0.0564 0.0725 1.9823 0.05313 0.0065 0.0362

(3.926 x 10- -03) (3.08 x 10— 02) (0.2345) (3.09 x 10— 02) (4.036 x 10- -03) (3.103 x 10- -°2)

100 0.02762 0.0354 1.2356 0.0256 0.0029 0.0177

(2.591 x 10- -05) (2.044 x 10- -04) (0.0083) (2.055 x 10- -04) (2.688 x 10- -05) (2.064 x 10- -04)

150 0.01835 0.0232 0.8017 0.0168 0.0019 0.0116

(2.536 x 10- (2.024 x 10- -06) (0.0072) (2.048 x 10- -06) (2.682 x 10- (2.053 x 10- -06)

200 0.0138 0.0174 0.2211 0.0127 0.0014 0.0087

(4.744 x 10- 09) (3.928 x 10- -08) (0.0037) (3.991 x 10- -08) (5.322 x 10- -09) (4.046 x 10- -08)

Table 2: Simulation outcomes obtained for the parameter value of 7 = 0.9 and 9 = 2.5 are presented herein. The values enclosed within the parentheses denote the Mean Squared Error (MSE) values.

f (x, a, fi, A, 7) = e—7( A)fix—(fi+1) (1 — e—y(A)fiJ \

Data Set 1: This data set has been taken from [27]. The data on survival of 40 patients suffering from leukemia, from the Ministry of Health Hospitals in Saudi Arabia, was taken from

Abouammoh et al. (1994):

115 181 255 418 441 461 516 739 743 789 807 865 924 983 1024 1062 1063 1165 1191 1222 1222 1251 1277 1290 1357 1369 1408 1455 1478 1549 1578 1578 1599 1603 1605 1696 1735 1799 1815 1852

Table 3: Estimates and Goodness-of-fit measures based on AIC, BIC, AICC, and CAICfor Data Set 1

Distribution Estimates Log-Likelihood AIC BIC AICC CAIC

MIE 7= 0.1350 -326.978 656.822 660.199 657.146 660.199

= 1.7856

IGWD a = 0.0426 -346.170 698.340 703.406 699.006 703.406

fi = 1.3546

= 2.7048

GIGWD a= 0.0323 -367.455 742.909 749.664 744.052 749.664

fi= 0.9067

= 1.2614

c = 4.2107

From Table 3, it shows that the proposed Modi Inverse Exponential distribution model has the lowest AIC, BIC, AICC, and CAIC values among the other distributions, suggesting that it provides the best fit to the dataset.

Data Set 2: This data set represents survival times in Days, from a Two-Arm Clinical Trial considered by [28] and [29]. The survival time in days for the 31 patients from Arm B are:

37 84 92 94 110 112 119 127 130 133 140 146 155 159 173 179 194 195 209 249 281 319 339 432 469 519 633 725 817 1557 1776

From Table 4, we can see that our proposed model MIE has minimum AIC, BIC, AICC, and CAIC values compared to IGWD and GIGWD distributions. Thus, we can infer that the newly proposed model is a better fit for the given data compared to the other models.

Table 4: Estimates and Goodness-of-fit measures based on AIC, BIC, AICC, and CAICfor Data Set 2

Distribution Estimates Log-Likelihood AIC BIC AICC CAIC

MIE y= 0.0618 -206.657 417.315 420.183 417.744 420.183

= 1.1340

IGWD a= 0.0596 -217.366 440.733 445.035 441.622 445.035

P = 1.8165

= 4.5081

GIGWD a= 0.0665 -217.015 442.032 447.767 443.570 447.767

P = 0.8546

= 4.9719

c= 1.0867

6. Conclusion

In this article, We establishes the Modi Inverse Exponential distribution which a right skewed heavy tailed UBT shaped probability model. The related structural properties are derived and represented in the respective sections. Furthermore, we explore the tail behavior of the suggested model and conclude that it is heavy-tailed. To estimate the distribution's parameters, different estimation methods such as method of maximum likelihood, method percentile and Method of Cramer-von Mises are used. For the simulated data set, the results are shown in Table 1 and 2. We can see that the estimated values obtained are near to the predefined parameters, and as n increases, MSE decreases, confirming the law of large numbers. However, the application to two real-life data sets shows that the MIE distribution has a better fit than other competing models, such as the Inverse Generalized Weibull distribution (IGWD) and Generalized Inverse Generalized Weibull distribution, based on goodness-of-fit measures AIC, BIC, AICC, and CAIC.

References

[1] Venturini, S., Dominici, F. and Parmigiani, G. (2012). Gamma shape mixtures for heavy-tailed distributions. Annals of Applied Statistics, 2(2):756-776.

[2] Ickowicz, A. and Sparks, R. (2017). Modelling hospital length of stay using convolutive mixtures distributions. Statistics in Medicine, 36(1):122-135.

[3] Harini, S., Subbiah, M. and Srinivasan, M. R. (2017). Fitting length of stay in hospitals using transformed distributions. Communications in Statistics - Case Studies, Data Analysis and Applications, 4(1):1-8.

[4] Karagrigoriou, A. and Vonta, I. (2014). Statistical Inference for Heavy-Tailed Distributions in Technical Systems. In 2014 Ninth International Conference on Availability, Reliability and Security :412-419.

[5] Jayakumar, K. and Fasna, K. (2022). On a New Generalization of Cauchy Distribution. Asian Journal of Statistical Science, 2(1): 61-81.

[6] Bhati, D. and Ravi, S. (2018). Evolution by gene duplication. On generalized log-Moyal distribution: A new heavy-tailed size distribution. Insurance: Mathematics and Economics, 79:247-259.

[7] Beirlant, J., Matthys, G. and Dierckx, G. (2001). Heavy-tailed distributions and rating. Astin Bulletin, 31(1):37-58.

[8] Resnick, S. I. (1997). Discussion of the Danish data on large fire insurance losses. Astin Bulletin, 27(1):139-151.

[9] Dutta, K. and Perry, J. (2006). Evolution by gene duplication. A tale of tails: an empirical

analysis of loss distribution models for estimating operational risk capital.

[10] Adcock, C., Eling, M. and Loperfido, N. (2015). Skewed distributions in finance and actuarial science: a review. European Journal of Finance, 21:1253-1281.

[11] Bennett, S. (1983). Log-logistic regression models for survival data.Applied Statistics, 32(2):165.

[12] Lamberson, L. R. (1974). An evaluation and comparison of some tests for the validity of the assumption that the underlying distribution of life is exponential. AIIE Transactions, 6(4):327-337.

[13] LIN, B. S. DURAN, T. O. LEWIS (1989). Inverted gamma as a life distribution. Microelectronics Reliability, 29(4):619-626.

[14] V. K. SHARMA, S. K. SINGH, U. SINGH, V. AGIWAL (2015). The inverse Lindley distribution: A stress-strength reliability model with application to head and neck cancer data. Journal of Industrial and Production Engineering, 32(3): 162-173.

[15] Oguntunde, P. and Adejumo, O. (2014). The transmuted inverse exponential distribution. International Journal of Advanced Statistics and Probability, 3(1):1-7.

[16] Khan, M. S., King, R., and Hudson, I. (2014). Characterizations of the transmuted inverse Weibull distribution. ANZIAM Journal, 55:197-217.

[17] Sharma, V. K., Singh, S. K., and Singh, U. (2014). A new upside-down bathtub shaped hazard rate model for survival data analysis. Applied Mathematics and Computation, 239:242-253.

[18] Sharma, V. K., Singh, S. K., Singh, U., and Merovci, F. (2016). The generalized inverse Lindley distribution: A new inverse statistical model for the study of upside-down bathtub data. Communications in Statistics - Theory and Methods, 45(19):5709-5729.

[19] Modi, K., Kumar, D., and Singh, Y. (2020). A new Family of Distribution with Application on Two Real Datasets on Survival Problem. Science and Technology Asia, 25: 1-10.

[20] Glaser, R. E. (1980). Bathtub and related failure rate characterizations. Journal of the American Statistical Association, 75(371): 667-672.

[21] Galton, F. (1883). Inquiries into Human Faculty and its Development,Macmillan and Company, London. (25).

[22] Moors, J. J. A. (1988). A quantile alternative for kurtosis. Journal of the Royal Statistical Society, Series D , 37(1):25-32.

[23] Gupta, R. C., Gupta, P. L., and Gupta, R. D. (1998). Modeling failure time data by Lehmann alternatives. Communications in Statistics - Theory and Methods, 27(4):887-904.

[24] Klugman, S.A, Panjer, H.H, Willmot, G.E.(2012) Loss Models: From Data to Decisions, vol. 715, John Wiley and Sons, New York.

[25] Nair,J., Wierman, A.and Zwart,B.(2013). The fundamentals of heavy-tails: Properties, emergence, and identification. In ACM SIGMETRICS Performance Evaluation Review, 41:387-388.

[26] Mann, N., Schafer, R., and Singpurwalla, N. (1974). Methods for Statistical Analysis of Reliability and Life Data. Wiley.

[27] Abouammoh, A., Abdulghani, S., Qamber, I. (1994). On partial orderings and testing of new better than renewal used classes. Reliability Engineering and System Safety, 43:37-41.

[28] Efron, B. (1988). Logistic Regression, Survival Analysis, and the Kaplan-Meier Curve. Journal of the American Statistical Association, 83:414-425.

[29] Mudholkar, G. S., Srivastava, D. K., and Kollia, G. D. (1996). A Generalization of the Weibull Distribution with Application to the Analysis of Survival Data. Journal of the American Statistical Association, 91.

i Надоели баннеры? Вы всегда можете отключить рекламу.