Научная статья на тему 'Comparison methods of modeling continuous random variables on empirical distributions'

Comparison methods of modeling continuous random variables on empirical distributions Текст научной статьи по специальности «Математика»

CC BY
73
30
i Надоели баннеры? Вы всегда можете отключить рекламу.
Область наук

Аннотация научной статьи по математике, автор научной работы — Farhadzadeh E. M, Farzaliyev Y. Z., Muradaliyev A. Z.

The new method of modeling of continuous random variables on empirical distributions offered. It shown, that discrepancy of accuracy of methods to shown requirements is shown at small number of realizations of random variables, reduced to not casual divergence of estimations of averages and average quadratic values empirical given and modeled samples.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «Comparison methods of modeling continuous random variables on empirical distributions»

Farhadzadeh E.M., Farzaliyev Y.Z., Muradaliyev A.Z. - COMPARISON METHODS OF MODELING CONTINUOUS RANDOM VARIABLES ON EMPIRICAL RT&A # 02 (29)

DISTRIBUTIONS (Vol.8) 2013, June

COMPARISON METHODS OF MODELING CONTINUOUS RANDOM VARIABLES ON

EMPIRICAL DISTRIBUTIONS

Farhadzadeh E.M., Farzaliyev Y.Z., Muradaliyev A.Z.

Azerbaijan Scientific-Research and Design-Prospecting Institute of Energetic AZ1012, Ave. H.Zardabi-94, e-mail :fem1939@rambler. ru

ABSTRACT

The new method of modeling of continuous random variables on empirical distributions offered. It shown, that discrepancy of accuracy of methods to shown requirements is shown at small number of realizations of random variables, reduced to not casual divergence of estimations of averages and average quadratic values empirical given and modeled samples.

STATEMENT OF THE PROBLEM.

One of the basic stages of imitating modeling is formation of random variables and casual events with the set law of distribution. In conditions of electro power systems (EPS) examples of random variables are: duration of emergency repair of the equipment and devices, intervals of time between non-working conditions of power units, the maintenance of soluble gases in transformer oil, etc. Casual events: short circuits on transmission lines, refusal in switching-off of the switch, false work of relay protection or automatics, etc. the Analytical form of laws of distribution here is in most cases unknown. Laws of change of a continuous random variable set by statistical (empirical) function of distribution (s.f.d.), and a discrete random variable - proceeding from those or other assumptions of probability of occurrence of casual event. This feature brings the certain interrelation between number of intervals s.f.d. F*(X), and number of intervals m at discrete representation of continuous empirical function of distribution F(X). If for F(X) the number of intervals m gets out equal (10^20), for F*(X) m=n.

Objectivity of imitating modeling in many respects depends on that, how much realizations of modeled random variables (events) will appear casual and will reflect the set laws of distribution. It is necessary to note also, that in practice often aspire to present set of statistical data one of known laws of distribution. Actually, the law of distribution of the statistical data concerning a class multivariate represents an uncertain composition of many distributions. In other words, difficulties of representation observable s.f.d. objective analytical law in many respects increase.

Methods of statistical modeling. By development of these methods, the greatest attention given a condition when the type of function of distribution of continuous random variable X known. Statistical modeling on empirical distribution is carried out by two methods. According [1] s.f.d. represented the following equations:

f;CX)=

0 еслиХ = X0 = 0

(3)

I + (XеслиХ < X < X (1)

n n(X1+1 - Xi)

1 еслиХ >Хт

where i=0,(n-1)

If to designate realization of a random variable with uniform distribution in an interval [0,1] through that according to (1) calculation corresponding £ realizations of random variable X it is carried out under the formula:

X = Xi + (Xi+! -Xi)• (£• n-i) (2)

where i=0,(n-1)

Intuitively clearly, that if the divergence (Xn-X1) is commensurable with Х1, modeling s.f.d. Fj** (X) under the formula (2) leads to regular distinction Fj* (X) and Fj** (X). This distinction shown in following parities of averages (accordingly M*(X) and M**(X)) and average quadratic (accordingly G*(X) and G**(X)) values of random variable X:

M*(X) > M**(X)

G*(X) < G**(X) Graphic illustration of this method is resulted on fig.1a. In the second method [2] s.f.d. represented the following equation:

0 если X < X1

F2* (X) J-^i +-- если Х1 < X < Xn (4)

I n -1 (n -1) • (Xi+1 - Xi) 1 n

1 еслиХ > Xn Calculation of realization of random variable X spent under the formula:

X = Xi + (Xi+1 - Xi) • [£• (n - 1) - (i - 1)] (5)

where i=1,(n-1)

In [2] it is marked, that obvious lack of this method is modeling random variable X in interval X1<X<Xn, in other words, size X never can be less Х1 and more Xn, that the brings the

certain error of an estimation M2*(X). Graphic illustration F2*(X) and components of the formula (5) is resulted on fig.1B.

Features of calculation under formulas (2) and (5) have caused expediency of specification of these methods of modeling. S.f.d. recommends presenting the following the equation [4]:

0 если X < X1

F3* (X) = | A-i +-- если Х1 < X < Xn+1 (6)

I n + 1 (n + 1) • (Xi+1 - Xi) 1 n+1

1 если Х > Хп+2

where i=1,(n+1)

Thus, calculation of realization of random variable X carried out under the formula: X = Xi + (Xi+1 - Xi) • [£• (n + 1) - (i - 1)] (7)

where i=1,(n+1)

The graphic illustration of components F3 (X) is resulted on fig.^

Fig. 1. Illustration methods of modeling continuous random variables on empirical distribution. a -method [1]; b - method [2]; c - method of authors

Algorithm of comparison of methods of statistical modeling. The basic requirement shown to

**

methods of statistical modeling, accuracy of conformity of distribution Fj (X) to initial distribution F*(X), where j=1,3. Most simple way of the control of a degree of such conformity at small values n is comparison of estimations of average values ME(X) and M* (X), and also average quadratic

values G*j (X) and G* (X) .

The block scheme of modeling algorithm is resulted on fug.2. 1 8

Fig 2. The integrated block diagram of algorithm of comparison of methods of modeling of continuous random numbers

Let's consider features of this algorithm by way of numbering blocks of its block diagram

(fig2.)

1. Initial data are:

— set of pseudo-random numbers with uniform distribution in an interval [0,1];

— n - number of random numbers £ in sample from {£}. Change n allows to establish its influence on result of comparison of methods of modeling of random variables X;

— a - significance value. Allows to estimate influence of a degree of conformity s.f.d. F*(£) to the uniform law on result of comparison of methods of modeling of random variables X;

— N - number of imitations of modeled sample {X}n

2. Under program RAND, (£) is formed n pseudo-random numbers corresponding the uniform law of distribution in an interval [0,1];

3. Considering, that X=£, average M*(X) and an average quadratic G*(X) values on sample is calculated

4. Under formulas (1), (4) both of (6) and sample {£}n are formed s. f.d. F (X), F2*(X) and F3*(X);

5. Sample from n pseudo-random numbers with an opportunity of the control of conformity of distribution F*(£) is formed to the uniform law with the set significance value a. The method of the general random numbers forms three samples from n random numbers X on distributions F*(X), F2*(X) and F3*(X). Calculations are spent for significance values (a errors I type) a, and number of realizations of sample N=1000;

6. Estimations of an average M*v(X) and average quadratic G*v(X) values of modeled random

variables on i- th to a method are calculated for y- th samples with i=1,3 and y=1,N;

7. Are formed s.f.d. Fi*[m*(X)J and Fi*[g*(X) for each of three methods i=1,3;

8. Comparison of methods is carried out by comparison M*(X) and G*(X) with similar parameters of distributions Fi*[M*(X)J and Fi* [g*(X)J, i.e. with M**(X) = M*[M*,T (X)] and

G**(X) = M*[G*T (X)] i= 1,3. Advantage is given a method for which the deviation from

M*E (X) and G*e (X) is minimal

RESULTS OF CALCULATIONS

It is established:

1. Influence of a method of modeling on accuracy of reproduction of distribution F*(£) it is shown only for small n. Already at n>20 divergence between M*(X) and M**(X), as well as G*(X) and G**(X) with i=1,3 does not exceed 1%. Notice, that at n=4 the divergence between M*(X) and M**(X) makes 12%, and between G*(X) and G**(X) makes 28.5%;

2. The size of a divergence including the greatest, between F*(Xj) and Fi**(XJ) (designate this size as Stj) does not depend on law of change F*(Xj) and Fi**(XJ), and depends on random

variables of sample {£}n, their numbers n and a way of modeling i= 1,3. As an example on fig.3 the graphic illustration of independence Stj with j=1,n from type F*(X) is resulted. In it finds reflection known nonparametric character of criterion of the greatest divergence [3] 3. Comparison of methods of modeling shows, that

M*(X) = M2*(X) = M3*(X) » M**(X) M*[G2(X)] « G*(X) = M*[G3(X)] « M*[G*(X)]

In other words, the first method does not meet shown requirements to accuracy of calculation both on size M*(X), and on value G*(X). Average values M2*(X) and M3*(X),

calculated at modeling samples random variables, accordingly, the second (i=2) and the third (i=3) methods, are practically indiscernible and equal to M*(X). However, average quadratic values modeled samples for the second method of modeling {X}n essentially differ from a reference value G*(X) while the size M*[G3(X)] practically does not differ from G*(X).

Graphic illustration of distinction s.f.d. R*[M*(X)] = 1 - F* [m*(X)] and

Fig.4. Illustration of distinction s.f.d. averages (a) and averages quadratic (b) values of realizations выборок modeled i=1,2 и3 methods

4. With increase a:

- Average value M*[M*(X)] with i=1,3 i.e. for each method of modeling aspires to the true value and allows to compare with methods more full. At a=0.8 following values are received: M*[M*(X)] = 0.629, M2[M2(X)] = 0.751 and M3[M3(X)] = 0.738 at M*(X) = 0.739;

- The disorder of realizations M*(X) with i= 1,3 decreases. If for a=0 for M*(X) it made G*[M*(X)] = 0.143, at a=0.8 size G*[M*(X)] = 0.066, i.e. disorder of realization M*(X) decreases in 2,2 times. The same reduction of disorder observed for the second and third methods;

- Distinction value of realizations G*(X), on the average, practically invariable also does not exceed 10% for n=4 and 3% for n=16. In the illustrative purposes on fig. 5 distributions F*[M*(X)]

and F*[G*(X)] are resulted at a=0.8, parities of considered methods confirming independence from a

- Consequences from increase a are similar to consequences of artificial increase in number of modeled random variables n on size (1-a)"1.

1,0 o,6 0,6 0.H 0,2. 0

1,0 0,8 0,6 0,H

0,1

M*(X)

<5\)

0,( 0,1 0,i D,4 0,5 0,6 0,1 0,8 0,9 1,0

Fig.5. Graphic illustration s.f.d. F*[M*(X)] and F*[G*(X)] ata=0.8

0,1 0,1 0.Ï 0,4 0,5 qt 0,1 0,4 0,5 1,0

CONCLUSION

The lead complex analysis has allowed establishing:

1. Discrepancy of accuracy of methods of modeling of continuous random variables on empirical distributions to shown requirements is shown only at small number of realizations of sample of random variables (n<20)

2. Comparison of methods of modeling can be lead by comparison of modeled estimations of averages and average quadratic values of random variables to empirical values of estimations of these parameters

3. Modeling of continuous random variables on the empirical distribution calculated under the formula (1), at small n leads to essential distinction of averages and average quadratic values of random variables of sample from empirical values, and under the formula (4) - average quadratic values of sample

4. Increase in a significance value a conformity of sample from n pseudo-random numbers to the uniform law on the consequences to similarly artificial increase in number n on size (1-a)-1

5. Statistical modeling of random variables on empirical distributions is expedient for spending under the formula (7).

REFERENCE

1. Pole Y.G. Likelihood modeling on electronic computers.-M.: Sov. radio, 1971, 400c.

2. Averill Law, V. Devid's, Kelton M. Imitating modeling. Classics CS,. - SPb.: Peter; Kiev: Publishing group BHV, 2004 847 p.

3. Gnedenko B.V., Beljaev J.K.,Solovyov A.D. Mathematical nightingales methods in the theory of reliability. "Science", 1965, 524 p.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

4. Farhadzadeh E.M., Muradaliev A.Z., Rafiyeva N.K., Nazirova U.K. Method of statistical modeling of random variables on empirical distributions. Kazan: News of High schools. Problems of Power №9-10, 2008, 112-120 p

i Надоели баннеры? Вы всегда можете отключить рекламу.