Научная статья на тему 'Constructing Tolerance Limits On Order Statistics In Future Samples Coming From Location-Scale Distributions'

Constructing Tolerance Limits On Order Statistics In Future Samples Coming From Location-Scale Distributions Текст научной статьи по специальности «Математика»

CC BY
93
9
i Надоели баннеры? Вы всегда можете отключить рекламу.
Область наук
Ключевые слова
order statistics / F distribution / tolerance limits / location-scale distribution

Аннотация научной статьи по математике, автор научной работы — Nicholas A. Nechval, Konstantin N. Nechval, Vladimir F. Strelchonok

Although the concept of statistical tolerance limits has been well recognized for long time, surprisingly, it seems that their applications remain still limited. Analytic formulas for the tolerance limits are available in only simple cases. Thus it becomes necessary to use new or innovative approaches which will allow one to construct tolerance limits on future order statistics for many populations. In this paper, a new approach to constructing lower and upper tolerance limits on order statistics in future samples is proposed. Attention is restricted to location-scale distributions under parametric uncertainty. The approach used here emphasizes pivotal quantities relevant for obtaining tolerance factors and is applicable whenever the statistical problem is invariant under a group of transformations that acts transitively on the parameter space. It does not require the construction of any tables and is applicable whether the past data are complete or Type II censored. The proposed approach requires a quantile of the F distribution and is conceptually simple and easy to use. For illustration, the normal and log-normal distributions are considered. The discussion is restricted to one-sided tolerance limits. A practical example is given.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «Constructing Tolerance Limits On Order Statistics In Future Samples Coming From Location-Scale Distributions»

Constructing Tolerance Limits On Order Statistics In Future Samples Coming From Location-Scale

Distributions

Nicholas A. Nechval

Dept. of Mathematics, Baltic International Academy, Riga, Latvia e-mail: [email protected]

Konstantin N. Nechval

Dept. of Applied Mathematics, Transport and Telecommunication Institute, Riga, Latvia

e-mail: [email protected]

Vladimir F. Strelchonok

Dept. of Mathematics, Baltic International Academy, Riga, Latvia e-mail: [email protected]

Abstract

Although the concept of statistical tolerance limits has been well recognized for long time, surprisingly, it seems that their applications remain still limited. Analytic formulas for the tolerance limits are available in only simple cases. Thus it becomes necessary to use new or innovative approaches which will allow one to construct tolerance limits on future order statistics for many populations. In this paper, a new approach to constructing lower and upper tolerance limits on order statistics in future samples is proposed. Attention is restricted to location-scale distributions under parametric uncertainty. The approach used here emphasizes pivotal quantities relevant for obtaining tolerance factors and is applicable whenever the statistical problem is invariant under a group of transformations that acts transitively on the parameter space. It does not require the construction of any tables and is applicable whether the past data are complete or Type II censored. The proposed approach requires a quantile of the F distribution and is conceptually simple and easy to use. For illustration, the normal and log-normal distributions are considered. The discussion is restricted to one-sided tolerance limits. A practical example is given.

Keywords: order statistics, F distribution, tolerance limits, location-scale distribution

1. Introduction

Statistical tolerance limits are an important tool often utilized in areas such as engineering, manufacturing, and quality control for making statistical inference on an unknown population. As opposed to a confidence limit that provides information concerning an unknown population

parameter, a tolerance limit provides information on the entire population; to be specific, onesided tolerance limit is expected to capture a certain proportion or more of the population, with a given confidence level. For example, an upper tolerance limit for a univariate population is such that with a given confidence level, a specified proportion or more of the population will fall below the limit. A lower tolerance limit satisfies similar conditions. It is often desirable to have statistical tolerance limits available for the distributions used to describe time-to-failure data in reliability problems. For example, one might wish to know if at least a certain proportion, say f, of a manufactured product will operate at least T hours. This question can not usually be answered exactly, but it may be possible to determine a lower tolerance limit L(Xi, ..., Xn), based on a preliminary random sample (Xi, ..., Xn), such that one can say with a certain confidence y that at least 100f % of the product will operate longer than L(Xi, ..., Xn). Then reliability statements can be made based on L(Xi, ..., Xn), or, decisions can be reached by comparing L(Xi, ., Xn) to T. Tolerance limits of the type mentioned above are considered in this paper. That is, if fe(x) denotes the density function of the parent population under consideration and if S is any statistic obtained from the preliminary random sample (Xi, ., Xn) of that population, then L(S) is a lower y probability tolerance limit for proportion i if

Pr

J fe(x)dx > ß

L(S)

(1)

and U(S) is an upper y probability tolerance limit for proportion f if

U(S)

Pr

J fe(x)dx > ß

= 7,

(2)

where 0is the parameter (in general, vector).

The common distributions used in life testing problems are the normal, log-normal, exponential, Weibull, and gamma distributions [1]. Tolerance limits for the normal distribution have been considered in [2], [3], [4], and others.

Tolerance limits enjoy a fairly rich history in the literature and have a very important role in engineering and manufacturing applications. Patel [5] provides a review (which was fairly comprehensive at the time of publication) of tolerance limits for many distributions as well as a discussion of their relation with confidence intervals for percentiles and prediction intervals. Dunsmore [6] and Guenther, Patil, and Uppuluri [7] both discuss 2-parameter exponential tolerance intervals and the estimation procedure in greater detail. Engelhardt and Bain [8] discuss how to modify the formulas when dealing with type II censored data. Guenther [9] and Hahn and Meeker [10] discuss how one-sided tolerance limits can be used to obtain approximate two-sided tolerance intervals by applying Bonferroni's inequality. Tolerance limits on order statistics in future samples coming from a two-parameter exponential distribution have been considered in [11].

In contrast to other statistical limits commonly used for statistical inference, the tolerance limits (especially on order statistics) are used relatively rarely. One reason is that the theoretical concept and computational complexity of the tolerance limits is significantly more difficult than that of the standard confidence and prediction limits. Thus it becomes necessary to use new or innovative approaches which will allow one to construct tolerance limits on future order statistics for many populations.

In this paper, a new approach to constructing lower and upper tolerance limits on order statistics in future samples is proposed. For illustration, the normal and log-normal distributions

that are commonly used in reliability and risk theory are considered. Although the concept of statistical tolerance limits has been well recognized for long time, surprisingly, it seems that their applications remain still limited.

2. Mathematical Preliminaries 2.1. Probability Distribution Function of Order Statistic

Theorem 1. If there is a random sample of m ordered observations Yi<...<Ym from a known distribution (continuous or discrete) with density function fe (y), distribution function Fe (y), then the probability distribution function of the kth order statistic Yk, ke{1, 2, ..., m}, is given by

Pe (Yl ^ У1 ) = J f2(m-k+1),2k (x)dx

(3)

1-Fe (yk ) 2k

F (yk ) 2(m-k+1)

where

J2(m-k+l),2k

(x) =

1

В

2(m-fc + l) 2k

2 ' 2

2{m — k + 1)

2k

2(m-k + 1)

2k

2(m-i+l)/2-l

2(m — k + 1)

1 H—--1 x

2k

- [2(m-fc+l)+2fc]/2

X > 0,

(4)

is the probability density function of an F distribution with 2(m-k+1) and 2k degrees of freedom.

Proof. Suppose an event occurs with probability p per trial. It is well-known that the probability P of its occurring k or more times in m trials is termed a cumulative binomial probability, and is related to the incomplete beta function Ix(a, b) as follows:

j=k

p3( 1 — p)m 3 = I (k,m — к + 1).

It follows from (5) that

(5)

j=k

[FeivM - РМГ3 = IFM)(k, m-k + l)

FÀVt)

B(/c, m-k + l)

J uk-\l-u)

2(m - к + 1)

2k

2(m-fc+l)/2

рв(Ук) 2(m-k+l)+2k

В

2k 2{m — k + l)

2 ' 2

I

1

2

и

1 -u

2k

2(m-fc+l)/2-l

2(m — k + 1)

— 2k

2(m — k + 1)

du

FF

u

2 (m-k + 1) 2k

2(m-k+\)/2

2(m — k + 1) 2k 2 ' 2

B

■XJ /

X

2(m-fc+l)/2-l

Fa(yk) 2(m-k+l)

2 (m-k + 1)

1 H—--x

2k

- [2(m-k+l)+2k]/2

dx, (6)

where

1 -u 2k

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

x =--. (7)

u 2{m-k + 1) w

This ends the proof. Corollary 1.1.

1 -F„(vt) 2k

F0(yk) 2(ra-*+l)

P*(Yk >yk) = 1 - Pe{Yk <yk}= J f^k+1)»(x)dx. (8)

o

Corollary 1.2. If yk,m,ris the quantile of order y for the distribution of Yk, we have from (8) that yk,m;y is the solution of

Fo (V^n ) = k/[k + (m-k + l)g2(m_4+1)>M;1_T ], (9)

where #2(m-*+i)2*- x_r is the quantile of order 1-yfor the F distribution with 2(m-k+1) and 2k degrees of freedom.

2.2. Normal and Log-Normal Distributions

The normal and log-normal distributions are commonly used to model certain types of data that arise in several fields of engineering as, for example, different types of lifetime data (see, e.g., [12]). The goal of modeling certain types of data is to provide quantitative forecasts of various system performance measures such as service level, expected waiting time, agent's occupancy, schedule efficiency, cost etc. Evaluation of these performance measures is important to making optimal decisions about overall cost, system performance, which has to be within the allowable budget and other performance based constraints.

Particular properties of the log-normal random variable (as the non-negativeness and the skewness) and of the log-normal hazard function (which increases initially and then decreases) make log-normal distribution a suitable fit for some engineering data sets. The log-normal distribution is used to model the lives of units whose failure modes are of a fatigue-stress nature. Since this includes most, if not all, mechanical systems, the log-normal distribution can have widespread application. Consequently, the log-normal distribution is a good companion to the Weibull distribution when attempting to model these types of units. As may be surmised by the name, the log-normal distribution has certain similarities to the normal distribution. A random variable is log-normally distributed if the logarithm of the random variable is normally distributed. Because of this, there are many mathematical similarities between the two distributions. For example, the mathematical reasoning for the construction of the probability plotting scales and the bias of parameter estimators is very similar for these two distributions.

36

Nevertheless, the log-normal distribution differs from the normal distribution in several ways. A major difference is in its shape: where the normal distribution is symmetrical, a lognormal one is not. Because the values in a lognormal distribution are positive, they create a right skewed curve

(Figure 1).

foix)

X 0.9 n2-n h A —- a2=i \ I \ --a==2 25

O.S

0.7

O.G AX \

0.5 / k\ \

0-4 0.3 /v \ I1 f v X \ / \

0.2 /

0.1 fJ __

0

-O 1

c 1 2 3 4 5 0

Figure 1. Log-normal probability density functions with f=0 for selected values of a2.

The log-normal distribution has played major roles in diverse areas of science. Royston [13] modeled survival time in cancer with an emphasis on prognostic factors using the log-normal distribution. Log-normal distributions gave appropriate description of the overall service times and the service times of administrative, e-mail, miscellaneous and network jobs.

Finally, log-normal distributions are self-replicating under multiplication and division, i.e., products and quotients of log-normal random variables are themselves log-normal distributions (Crow and Shimizu [14]; Aitchison and Brown [15]), a result often exploited in back-of-the-envelope calculations.

A positive random variable X is said to be log-normally distributed with two parameters ju and a 2 if X = In X is normally distributed with mean ju and variance cr2. The two-parameter lognormal distribution is denoted by A((u,ct2); the corresponding normal distribution is denoted by N(ju,a2). The probability density function (pdf) of X having A(ju,a2) is

1

xcr

exp

[In x — /u\

2cF

x > 0, - oo <ju <oo, cr > 0,

(10)

where (h(ju,a2). The cumulative distribution function (cdf)) of X is given by

Fg (x) = Pr(Z < x) = ®

In X - jLl

(11)

It follows from (10) that

X~fe{x) = -

ayfzrr

exp

(x — fi)2

2a

— OO < X < oo,

(12)

that is, X = \nX ~ N(/j,<j2), where 0 = (ju, a2), -oo < ¡j, < oo is the location parameter and cr > 0 is the scale parameter. The cdf of the normal distribution is given by

1

1

(тЧ2п _

exp

(х — jSf

2ал

dx.

(13)

It is known (Nechval and Vasermanis [16]) that the complete sufficient statistic for the parametric vector e, based on observations in a random sample (Xi, ..., Xn) of size n from the normal distribution (13) is given by

S =

(14)

Here the following theorem takes place.

Theorem 2. Let (Xi, ..., Xn) be a preliminary random sample from the normal distribution (13) , where it is assumed that the parametric vector e = (^,ct2) is unknown. Then the joint probability density function of the pivotal quantities,

is given by

where

v = y/n( x -u) v (n - ад2

V1 , '2 = 2

a

a

f (v) = fl(vl)f2(v2), V = (Vl,V2),

f(v) = "i=exp

' vT

v 2 у

(15)

(16)

(17)

(18)

f2(v2)- 2C-1V»r((n-1)/2)v; 2 eXP( ^ v2 *0.

(19)

Proof. The joint density of X1, ..., Xn is given by

fe( X1, ..., x ) = Щ( x, ) = П (2яет2)-1/2 exp i--1^ x-m)2

,=i ,=i V 2a

= (2жа2уп/2 exp| -—£(x,-j)2 |.

(20)

Using the invariant embedding technique (Nechval et al. [17], [18], [19]), we transform (20) to

1

fe(Xj, ..., xn)dxds\ = (2a2) n/2 exp--- V (x - x + x - ju)

V 2a ,=1

dxds,

= (2яст2 )-n/2 exp I--- £ [( x,. - x )2 + 2 ( x - x )(x - j) + (x - uf ] | dxdsj2

2a ,=1

= (2oCT2)-n/2exp

2a

£(x - x)2 + 2(x - u)Vn(xi - x) + n(x - u)2

dxds,2

-œ<v<œ

2\ —n/2 _

= (2 fta ) exp

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

2a

J ( xt — x )2 + n( x — j)2

dxds2

= n ~1/2(2<T1/2exp

n(x — jU)

2\

d

sfn( x — u)

V a y

X (ft)—(n—1)/2(n —1)—(n—1)/2(sf)—( n—1)/2 a: (2ft) 1/2 exp

'(n — 1)s^(n—1 )/2—1

V 2a2 y

exp

(n — 1)S12 ^f (n — 1)si

V â y V 2a y

2^

2 f \ (n—1)/2— f \ / \

V1 dv1 f h I V2 exp 1 d I — 1

1 2 y V 2 y V 2 y V .2 y

(21)

Normalizing (21), we obtain (16). This ends the proof. Thus,

n(0,1), V ~ j„2_1,

(22)

where V2 is statistically independent of Vi.

Theorem 3. If Vi is a normally distributed random variable with unit variance and zero mean, and V2 is a chi-squared distributed random variable with n-1 degrees of freedom that is statistically independent of V1, then

t Vi +A Vi +A ™

T = . 1 = /n_ 1,A (t) — œ< t

\JV2 /(n — 1) VF

(23)

is a non-central t-distributed random variable with n-1 degrees of freedom and non-centrality parameter A, where

F S2 (n- lV"-1)/2

- ~(„-l)/2

n — 1 a

r((n —1)/2)

w

(n—1)/2—1

exp(—(n — 1)w /2), w > 0, (24)

/n—1, A (i) =

(n — 1)

(n—1)/2

exp

(n — 1)A2 2(t2 + n — 1)

(n —1)/2) 2n/2 (t2 + n — 1)"

X J w"/2—1 exp

w1/2 —

tA

t + n — 1

2

dw., — œ < t < œ,

(25)

is the probability density function of T,

F = F (t2 + n — 1),

(26)

F_hA (t ) = Pr(T < t ) ^ (^ r ^ |}/ 2) | exP("(n " ^ / - A)^ (27)

is the cumulative distribution function of T. ®(x) is the standard normal distribution function. Note that the non-centrality parameter A may be negative. Proof. It follows from (23) that

(n—1)/2 œ

0

Pr(T < 11 W = w) = Pr

V +A ■s/w

< t

w

Pr V < tyfw -A) = 0(t>/W -A)

(28)

Since it follows from (24) and (28) that

Pr(T < t) = E {Pr(T < 11 W)} = J 0(tyfw - A) f^ (w)dw,

(29)

we get the cumulative distribution function Fn-l A(t ) of the non-central ¿-distribution given in (27). It is easy to show that the probability density function of T defined in (25) is given by

This completes the proof.

f -,A (t) = F' (t).

3. Tolerance Limits on Order Statistic

(30)

3.1. Lower Tolerance Limit

Theorem 4. Let Xi, ..., Xn be observations from a preliminary sample of size n from a normal distribution defined by the probability density function (12). Then a lower one-sided p-content tolerance limit at a confidence level y, Lk =Lk(S) (on the kth order statistic Yk, ke{1, ..., m}, from a set of m future ordered observations Yi < ... < Ym also from the distribution (12) ), which satisfies

is given by

where

Pr P0(Yk>Lk)>ß =7,

L = X

Vl =-trA-r/J" '

(31)

(32)

(33)

is the lower tolerance factor, trAy is the quantile of order y for the non-central i-distribution with r=n-1 degrees of freedom and non-centrality parameter A = -z1-s^^fn, z\-sp denotes the 1-Sp quantile of the standard normal distribution,

Sri = (m — k + 1 )q,

2(m-k+\),2k]ß

m — k + 1 )q,

2(m-k+l) ,2kfl

■k],

(34)

q2(m-k+1),2k-p is the quantile of order p for the F distribution with 2(m-k+1) and 2k degrees of freedom.

Proof. It follows from (8), (13) and (31) that

Pr Pß(Y > Lk)> ß = Pr

l--^(^)_

F0(Lh) 2(m-k+1)

l ^

2(m—k+l),2k

(.x)dx > ß

0

0

= Pr

1 -FÀh) 2 k

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

F0(Lk) 2(m-fc + l)

2(m-k+l) ,2k-ß

= Pr

FM <

k

k + (m — k + 1 )q,

2{m-h+\),2h\ß

= Pr

crV2vr

I

exp

(p-vf

2a'

dy >

(■m-k + 1 )g

2(ro-fc+l),2fc;/J

(m - fc + l)92(m_,+1)i2i;/3 +

= Pr

'x; /

exp

tfc > <5a

= Pr

V2tt

<7 /

exp

2; 2

tte < 1 -

= Pr

Lk —U

< z,

= Pr

L — X + X — J

< z,

= Pr

= Pr

h-^rn^< z^ 1=Pr (^ 4~n4F+v < z^ v" ) S1 G a y

V — z1—¿„V«

VF

= Pr

V + A

VF

^l^Ï" = Pr (t < —^ v" ) = F ,A (t ), (35)

where

is the lower tolerance factor,

A = — zx_s *Jn, r = n — 1, t = —^Vn.

It follows from (31), (35) and (37) that the lower tolerance factor should be chosen such that

Fr,A (t ) = Fr,A = Fr,A (tr,A-r ) = ^

(36)

(37)

(38)

where is the quantile of order y for the non-central ¿-distribution with r degrees of freedom and non-centrality parameter A. It follows from (38) that

Vl = - K a J • (39)

It follows from (36) that Lk = X + vl S • This completes the proof.

Corollary 4.1. It follows from (35) that Pr(riL4nJW + V <z1-Sfijn) can be transformed as follows:

Pr (iiL4n-W+v < ^ 4n)=Pr ( V < -vLJn-Jw+^ vn)

+z1—Sß4n

J /1 (v )dv = 0(—^ V^VF + Z1—^ 4n) = 0(tVF — A)

(40)

where

1

L

t = -VL^n » A = - V".

Then it follows from (31) and (40) that i has to be found such that

(41)

t = arg IE {o(t>/W - A)} = r) = arg J 0(t>/w - A)f (w)dw = r

V o

= arg

r/2 œ

/2i, .., Jwr/2-1 exp(-rw/2)0(^Vw-A) dw = r = arg(^(t) = r) = (42)

2 r(r /2) 0 )

where ^ is the quantile of order y for the non-central i-distribution with r=w-1 degrees of freedom and non-centrality parameter A,

F .(t) = Pr(T<t) = —— r,A() ( ) 2r/2r(r /2

--- \wr/2 1 exp(-rw /

(r /2)Jo P(

'2)0(t>/w -A)dw.

(43)

is the cumulative distribution function of T,

f» (t) = F a (t ) =

rr 2 exp(-rA2 / [2(t2 + r)])

J wfr-1)/2 exp

x J w.

0

V^r( r /2) 2("+1)/2(t2 + r) 2

O+1)/2

wi/2 ^ tA

' J

t2 + r

dw,, - œ < t < œ,

is the probability density function of T, where

W. = W (t2 + r).

Corollary 4.2. If

W„ = W (t2 + r)/2,

(44)

(45)

(46)

then

f, (t )=f;. a (t ) = ^r: W'», J w:r-1|/2 exp

r /2) (t2 + r )'

rr/2 exp(-A2 / 2) ^ T((r + j +1) / 2)

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

w„ -

t

t2 + r

dw_.

yfnr(r /2) (t2 + r)'

(r+1)/2

s

j=0

j !

/aV

VT"

v

t2 + r

- œ < t < œ.

(47)

This form of the density function is derived in Rao [20] and appears in Searle [21]. In both Rao and Searle, -Jn is incorrectly omitted from the denominator. It should also be noted that the central i-distribution is just a special case of the non-central i with A = 0.

Corollary 4.3. If k=m=1, then

Sp=p, A = - Zi-pV«. (48)

Corollary 4.4. Let Xl < ... < Xn be ordered observations from a preliminary sample of size n from a log-normal distribution defined by the probability density function (10). Then a lower onesided /^-content tolerance limit at confidence level y, Lk = Lk(S) (on the fcth order statistic },.

ke¡1, ..., m}, from a set of m future ordered observations 7, < ... < Yk also from the distribution (10) ), which satisfies

is given by

where

Pr Pe(Y. >Lk)>ß = 7,

^ = exP ) = exp (JT + ^^),

(49)

(50)

X,=lnl„ Z'e{l, ...,«}, X = ^X./n, Sf^iX.-X)2/(n-1),

i=1 i=1

A = -zx_Sß -Jiu 6ß = (m — k + l)q2im_k+wl[{m -k +1 )q2{m_k+ll2k,i + k]

tr,A;r= arg [Fr,A (t) =r] , r = n-1, Vl =-WA^' 3.2. Upper Tolerance Limit

(51)

Theorem 5. Let Xi, ..., Xn be observations from a preliminary sample of size n from a normal distribution defined by the probability density function (12). Then an upper one-sided ^-content tolerance limit at a confidence level y, Uk=Uk (S) (on the kth order statistic Yk from a set of m future ordered observations Yi<...<Ym also from the distribution (12)), which satisfies

is given by

where

Pr PgiY < U ) > ß =7,

Uk = X + VuSl,

Vu = tr ,A;1-V^ '

(52)

(53)

(54)

is the upper tolerance factor, A1_ is the quantile of order 1-y for the non-central ¿-distribution with r=n-1 degrees of freedom and non-centrality parameter A = -z1_^-s/n, z1-di denotes the 1-81-p quantile of the standard normal distribution,

6l _d = (m — k + 1 )q

2(m-k+l),2k-,l-ß

m — k + 1 )q,

2(m-k+X),2k

(55)

q2(m-k+1),2k.1-^ is the quantile of order 1-p for the F distribution with 2(m-k+1) and 2k degrees of freedom.

Proof. It follows from (3), (13) and (52) that

Pr P0(Yk <Uk)>ß = Pr

= Pr

1 -F„(Ut) 2k

F0(Uh) 2(m-k+l)

f f2{m-Ml2MdX<1-ß

= Pr

1 -Fe(Uk) 2 k

Fe{Uk) 2(m-fc + l)

<9.

2(m-k+l),2k]l-ß

= Pr

w >

k + (m — k +

= Pr

CT^TT

/

exp

2 a2

2(m-fc+l),2fc;l-i3

(m — k + 1)5,

+ k

= Pr

/

exp

2 2

= Pr

ut-i<

G

I

exp

dz>l-&

= Pr

> z,

1

= Pr

U -X + X-M

> z,

1

= Pr

U - X V 5

^ + ^(X-ß) > ^ ^1 = Pr+ Fi >

n n 11 V >

= Pr

- -Z1 ^ -Jw

n n

\

>-riv4n

= Pr| > | = Pr(r > -ru>/n) = 1 -^,A(t), (56)

where

is the upper tolerance factor,

Vu=(Uk-X)/S1,

A = -Zj_^ , r = n -1, t = -rvy[n.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

(57)

(58)

It follows from (49), (56) and (58) that the upper tolerance factor Vu should be chosen such that

F,a (t) = Fr,A (-%>/") = Fr,A (ty,A;1-y ) = 1 - y, (59)

where is the quantile of order 1 -yfor the non-central ¿-distribution with r degrees of freedom and non-centrality parameter A. It follows from (59) that

Vu = -tr ,A ;1- J "J» • (60)

It follows from (57) that Uk = X + >]l:S]. This completes the proof.

Corollary 5.1. Let Xx, ... < Xn be observations from a preliminary sample of size n from a lognormal distribution defined by the probability density function (10). Then an upper one-sided fi-content tolerance limit at confidence level y, Uk = Uk(S) (on the fcth order statistic },. ke ¡1, ..., m}, from a set of m future ordered observations ),<...< Yk also from the distribution (10) ), which satisfies

u

k

1

u

is given by

Pr Pß(Yk<Ük)>ß =7, (61)

Ük = exp (Uk) = exp (2+77^), (62)

where

n n

X,=inl„ z-e{i,...,«}, x = "£x./n, s;2=^r(x-x)2/(n-i),

i=1 i=1

A = -zx_Sif 47i, ={m-k + 1 )q2{m_k+lh2k.x_,/[(m -k + 1 )q2^_k+i)^_i3 + k],

Wy = arg [Fr,A (t) = 1 -y]- r = n-1, Vu =-t^l-y/V«- (63)

Remark 1. It will be noted that an upper tolerance limit may be obtained from a lower tolerance limit by replacing p by 1-p, y by 1-y.

4. Practical Example

A manufacturer of semiconductor lasers has the data on lifetimes (in terms of hours) obtained from testing n=10 semiconductor lasers. These data are given in Table 1.

Table 1. The data on lifetimes obtained from testing n=10 semiconductor lasers

Observations (in terms of hours)

X^ X^ x^ x ^ x^ x ^ x^ x ^ x^ x ^ 18657 18960 19771 21015 21183 21960 22881 24642 25373 27373

A buyer tells the laser manufacturer that he wants to place two orders for the same type of semiconductor lasers to be shipped to two different destinations. The buyer wants to select a random sample of m=5 semiconductor lasers from each shipment to be tested. An order is accepted only if all of 5 semiconductor lasers in each selected sample meet the warranty lifetime (in terms of hours). What warranty lifetime (in terms of hours) should the manufacturer offer so that all of 5 semiconductor lasers in each selected sample meet the warranty with probability of 0.95?

In order to find this warranty lifetime, the manufacturer wishes to use a random sample of size n=10 given in Table 1 and to calculate the lower one-sided simultaneous tolerance limit Lk=1(S) (warranty lifetime) which is expected to capture a certain proportion, say, p=0.95 or more of the population of selected items (m=5), with the given confidence level y=0.95. This tolerance limit is such that one can say with a certain confidence y that at least 100p % of the semiconductor lasers in each sample selected by the buyer for testing will operate longer than L1(S).

Goodness-of-fit testing. It is assumed that the data of Table 1 follow the log-normal probability distribution (10), where the parameters ¡i and a are unknown. Thus, for the above example, we have that n =10, m =5, k = 1, p= 0.95, y= 0.95,

S =

X = / n = 10, Si = - Xf / (n - 1) = 0.016302

(64)

We assess the statistical significance of departures from the model (10) by performing the Anderson-Darling goodness-of-fit test. The Anderson-Darling test statistic value is determined by

A2 = —

£ (2t — 1) (ln Fe( x, ) + ln (1 — Fe( xn+1_ t ) ))

in — n,

(65)

where Fg( ) is the cumulative distribution function of X = lnX,

0 = (u = x ,g = ^ ),

(66)

n is the number of observations.

The result from (65) needs to be modified for small sampling values. For the normal distribution the modification of A2 is

Aod = A2(1 + 0.75/ n + 2.25/ n2).

(67)

The ALj value must then be compared with critical values, A2, which depend on the significance level a and the distribution type. As an example, for the normal distribution the determined value has to be less than the following critical values for acceptance of goodness-of-fit (see Table 2):

Table 2. Critical values for ALd

a

a2

0.1 0.631

0.05 0.752

0.025 0.873

0.01 1.035

For this example, a=0.05, A2=005 = 0.752,

A2 = —

£(2t —1) (ln Fo( x, ) + ln (1 — Fo( xn+1—t )))

/10 — 10 = 0.193174,

ALd = A2(1 + 0.75/10 + 2.25/102) = 0.212 < A2=005 = 0.752.

(68)

(69)

Thus, there is not evidence to rule out the log-normal model (10).

Finding lower tolerance limit (warranty lifetime for semiconductor laser). Now the lower one-sided simultaneous ^-content tolerance limit at the confidence level y, Li = Li (S) (on the order statistic Yi from a set of m = 5 future ordered observations Yi < ... <Ym ) can be obtained from (50). Since m=5, k=1, ^=0.95, it follows from (51) that:

8fi = (m — k + 1 )q,

2(m-/fc+l),2/fc;;3

m — k + 1 )q,

2(m-k+X),2k

a +k] = 0.989796,

r = n — 1 = 9, A = —^>/«=7.3325, r=0.95,

(70)

(71)

the quantile of order y for the non-central t-distribution with r degrees of freedom and non-centrality parameter A is given by

trAr= arg ( Fr,A (t ) = r) = 12.5512, the lower tolerance factor is given by

nL = — tr, Ar/ ^ = —3.969.

(72)

(73)

Now it follows from (50), (64) and (73) that

L=1 = exp ( X + ) = 13270.

(74)

Statistical inference. Thus, the manufacturer has 95% assurance that at least 100^ % of the semiconductor lasers in each sample (m=5) selected by the buyer for testing will operate (in terms of hours) no less than Li=13270 hours.

This paper introduces a methodology to construct the one-sided tolerance limits on order statistics in future samples coming from location-scale distributions under parametric uncertainty. For illustration, the normal and log-normal distributions are considered. These distributions play a vital role in many applied problems of biology, economics, engineering, financial risk management, genetics, hydrology, mechanics, medicine, number theory, statistics, physics, psychology, reliability, etc., and have been extensively studied, both from theoretical and applications point of view, by many researchers, since its inception.

It will be noted that the theoretical concept and computational complexity of the tolerance limits is significantly more difficult than that of the standard confidence and prediction limits. Thus it becomes necessary to use new or innovative approaches which will allow one to construct tolerance limits on future order statistics for many populations. The concept proposed in this paper can be extended to two-sided tolerance limits too.

References

[1] Mendenhall, V. (1958). A bibliography on life testing and related topics. Biometrika, XLV: 521-543.

[2] Guttman, I. (1957). On the power of optimum tolerance regions when sampling from normal distributions. Annals of Mathematical Statistics, XXVIII: 773-778.

[3] Wald, A. and Wolfowitz, J. (1946). Tolerance limits for a normal distribution. Annals of Mathematical Statistics, XVII: 208-215.

[4] Wallis, W. A. (1951). Tolerance intervals for linear regression. In: Second Berkeley Symposium on Mathematical Statistics and Probability. Berkeley: University of California Press, pp. 43-51.

[5] Patel, J. K. (1986). Tolerance limits: a review. Communications in Statistics: Theory and Methodology, 15: 2719-2762.

[6] Dunsmore, I. R. (1978). Some approximations for tolerance factors for the two parameter exponential distribution. Technometrics, 20: 317-318.

[7] Guenther, W. C., Patil, S. A., and Uppuluri, V. R. R. (1976). One-sided p-content tolerance factors for the two parameter exponential distribution. Technometrics, 18: 333-340.

[8] Engelhardt, M. and Bain, L. J. (1978). Tolerance limits and confidence limits on reliability for the two-parameter exponential distribution. Technometrics, 20: 37-39.

[9] Guenther, W. C. (1972). Tolerance intervals for univariate distributions. Naval Research Logistics Quarterly, 19: 309-333.

[10] Hahn, G. J. and Meeker, W. Q. Statistical Intervals: A Guide for Practitioners, John Wiley,

1991.

[11] Nechval, N. A. and Nechval, K. N. (2016). Tolerance limits on order statistics in future samples coming from the two-parameter exponential distribution. American Journal of Theoretical and Applied Statistics, 5: 1-6.

[12] Meeker, W.Q. and Escobar, L. A. Statistical Methods for Reliability Data, John Wiley,

5. Conclusion

1998.

[13] Royston, P. (2001). The log-normal distribution as a model for survival time in cancer, with an emphasis on prognostic factors. Statistica Neerlandica, 55: 89-104.

[14] Crow, E.L. and Shimizu, K. Lognormal Distributions, Theory and Applications, Marcel Dekker, 1998.

[15] Aitchison, J. and Brown, J.A.C. The Lognormal Distribution, Cambridge University Press,

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

1957.

[16] Nechval, N. A. and Vasermanis, E. K. Improved Decisions in Statistics, Izglitibas Soli,

2004.

[17] Nechval, N. A., Nechval, K. N., and Vasermanis, E. K. (2003). Effective state estimation of stochastic systems. Kybernetes (An International Journal of Systems & Cybernetics), 32: 666-678.

[18] Nechval, N. A., Berzins, G., Purgailis, M., and Nechval, K. N (2008). Improved estimation of state of stochastic systems via invariant embedding technique. WSEAS Transactions on Mathematics, 7: 141-159.

[19] Nechval, N. A., Purgailis, M., Nechval, K. N., and Strelchonok, V. F. (2012). Optimal predictive inferences for future order statistics via a specific loss function. IAENG International Journal of Applied Mathematics, 42: 40-51.

[20] Rao, C. R. Linear Statistical Inference and its Applications, Wiley, 1965.

[21] Searle, S. R. Linear Models, Wiley, 1971.

i Надоели баннеры? Вы всегда можете отключить рекламу.