Научная статья на тему 'Research and Development Solution Methodology of Informative Features Choosing for the RES Life Cycle Processes Analysis'

Research and Development Solution Methodology of Informative Features Choosing for the RES Life Cycle Processes Analysis Текст научной статьи по специальности «Математика»

CC BY
74
30
i Надоели баннеры? Вы всегда можете отключить рекламу.

Аннотация научной статьи по математике, автор научной работы — Nevlyudov I.Sh., Andrusevich Anatoly Alexandrovich, Khatnyuk Igor Stanislavovich

The methods of solving the problem of choosing the informative features to identify the RES functioning are offered by classifying the RES and life cycle processes in the feature space, and each of which has a value that allowed us to find a comprehensive criterion and formalize the selection process.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «Research and Development Solution Methodology of Informative Features Choosing for the RES Life Cycle Processes Analysis»

Research and Development Solution Methodology of Informative Features Choosing for the RES Life

Cycle Processes Analysis

Nevlyudov I.Sh., Andrusevich А.А., Khatnyuk I.S.

Abstract — The methods of solving the problem of choosing the informative features to identify the RES functioning are offered by classifying the RES and life cycle processes in the feature space, and each of which has a value that allowed us to find a comprehensive criterion and formalize the selection process.

I. Introduction

The functional problem of selecting informative features for monitoring the life cycle of radio electronic systems (RESs LC) can be solved within the framework of development methodology for dictionary attributes in the classification systems and the objects state identification [1-5]. In the working vocabulary one should use only those features that, on the one hand, are the most informative and that, on the other hand, may be available for measurement.

The description of the dictionary of features under conditions of constraints on the cost of observation hardware creation has some particulars. If the objects attributes are denoted by 5j, j=1, 2,..., N, each object in the N-dimensional space of attributes can be represented as a vector х = (хі,х2,...,хn) , its coordinates characterize the properties of objects.

To determine the measure of closeness or similarity between objects in N-dimensional attributes vector space a metric is introduced. The Euclidean metric can be used as

N.

d2(wpk,Wql) = £ (х pk - х qlj)2, (1)

j=1

p, q = 1,2,...,m; k = 1,2,...,kp; l = 1,2,..., kq, where p, q = 1, 2,..., m; k = 1, 2, ...,kp; l = 1,2,..., kq,

х pk are the values of j-feature, k is an object, p is a class, i.e. the q is an object class, i.e. object wql.

Manuscript received December 8, 2011.

Nevlyudov I.Sh. is with Kharkov national University of Radioelectronics (14 Lenin Avenue, Kharkov 61166, Ukraine, tel. (057) 702-14-86, Email: tapr@khture.kharkov.ua).

Andrusevich Anatoly Alexandrovich is with Aviation Krivoy Rog College (1 Tupolev St., Krivoy Rog 50045, Ukraine, tel. (0564) 27-56-79, Email: uchebotdel@kk.nau.edu.ua).

Khatnyuk Igor Stanislavovich is with Kharkov National University of Radioelectronics (14 Lenin Avenue, Kharkov 61166, Ukraine, tel. (057) 702-14-86, E-mail: tapr@khture.kharkov.ua)

As a measure of proximity between the objects of this class Qp, p = 1, 2, ..., m, we will use the value

S(Q p) =

1 kp kp

kp kp-1 k=1 l=1

£ £d (wpk,wpl)

2

(2)

that makes sense as class rms dispersion or the rms scatter of objects within class Ц,, as a measure of proximity between a given pair of objects of classes Qp and Qq, p, q = 1,..., m, is the value

R(Q p, Q q) =

kp kq

1 r , 2

££d2(wpk,wql)

kp kq k=1l=1

(3)

that makes sense of rms dispersion of a class of objects Qp and Qq.

A set of objects features that are used in the working vocabulary can be described as an N-dimensional vector A = (a1, a 2,..., a n), whose components takes the values 1 or 0 depending on whether there is or is not a possibility to determine the appropriate object feature.

Taking into account the a is the squared distance between two objects wpk and wql the unequation given below will take the form

N

d2(wpk, wql) = £a.(х(j)pk -х(j)ql)2 . (4)

j=1

Consequently, the rms scatter of the class Qp and objects of classes Ц, and Qq can be written as

S(Q p) = .

2 1 kp kp N

7-j—7 £££a.(х(j)pk - х(j)pl) kp kp - 1k=1l=1j=1

(5)

2

R(Q p, Qq) =.

kp k.

1 KrqN ... ... 0

- £££a.(х(j)pk -х(j)pl)2 . (6)

kp kq k=1l=1j=1

1

One can take into consideration that the cost of using the feature is proportional to the information contained in them, i.e. to such quantities of objects features that can be determined with their help. This assumption is fairly general. Thus, the cost of using the features amount to

N

C = C(ab...,an) = £C. a., (7)

j=1

where Cj is costs to determine the j-sign.

R&I, 2012, №1

17

As an indicator of quality or effectiveness of the designed recognition system we consider the functional, which depends in general on the function S(Qp), R(Qp,Qp) and

decision rule L(w, {wg})

I = F [S(Qp); R(Qp, Qq); L(w,{wg })]. (8)

Let the value L(w,{wg}) be a measure of proximity between the recognizable object w and class Qg, g = 1, 2, ..., m, that is given by its objects {wg}. As

this measure of proximity let’s consider the value

L(w,{wg})

II

1 kg

— 2 d2(w,wg) kg g=i

(9)

which is mean square distance between the object w and the classes objects Qp.

The decision rule consists in the following

w eQg, если L(w,{wg}) = extr L(w,{wJ) . (10)

It is important to note that the decrease in S(Qp), "compression" of objects belonging to a given class, while increasing R(Qp, Qq), i.e. "division" of objects belonging to

different classes improve the quality of the recognition system. Therefore, improving the efficiency of the system will be linked to the achievement of functional extremum I.

II. Formulation of research problems

Formulation of research problems can be represented as follows.

Let the entire set of objects be divided into classes Qi, i = 1,..., m, a priori all classes being described in the

language of features х ■, j = 1, ..., N, and funds, whose

value is equal to c0, are allocated for creating observation hardware. It is required that a working vocabulary of features be built which provides the maximum possible efficiency of the system without exceeding the allocated funds.

Thus, the problem reduces to finding the condition extremum of the functional which looks like (8), i.e. to the definition of А which implements the

extr I = extrF [S(Qp); R(Qp,Qq); L(w,{wg})]

a a

N

C =2Cjaj <Co. (11)

j=1

Consider some special types of the functional (11). If the required efficiency of the recognition system can be achieved through a more compact arrangement of objects in each class, subject to certain conditions regarding the magnitude of R(Qp, Qq), then the problem reduces to finding

min max [S(Qi)] (12)

a i=1,...,m

when

N ( )

2 Cj a j < Co и R(Q p, Q q) > Ro(pq). (13)

j=1

If the required system performance can be achieved through the "bias" objects that belong to different classes under certain conditions with respect to the value s(Qi ) , i = 1,... , m, then the problem reduces to finding

when

max min [R(Qp, Qq)] (14)

a p,q=1,...,m

N

2 Cja j < Co и S(Qi) < S0. j=1

(15)

If a proper system performance can be achieved only by increasing the ratio of distances between the classes to the rms scatter within the classes of objects, then the problem reduces to finding

max min

a p,q=1,...,m

R2(Qp Qq)

S(Q p)S(Q q)

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

(16)

when

N

2

j=1

Cjaj < C0 .

(17)

III. The selection problem solution of informative FEATURES THAT CHARACTERIZE THE STATE OF THE RES LC PROCESSES

The problem considered above is a generalization of a nonlinear programming problem. optimality conditions for it can be formulated as follows: in order the vector C0 to be an optimal strategy, it is necessary that there exist a scalar P > 0 and vector p = {pb..., pn } which will be equal

2b r p r

r=1

dPj(C0)

dCj

= P, j = 1,

Np

Np

2 c0 = Co j=1

Np

2pr = 1, br = 0, если 2pjPj(C0) > W(C0)

r=1

j=1

(18)

An introduction to the consideration of scalar P and vector p increases the number of unknowns C0, pr and p to a value Np + n +1. However, the number of equations equals the number of unknowns, since for every r or pr = 0 , or

Np

2prPj(C0j) = W(C0). (19)

j=1

Thus, the solution of the system (18) makes it possible to determine the composition of the working vocabulary signs and optimal allocation of costs to provide tools for observing the recognition system under the assumption of depend-

18

R&I, 2012, №1

ence of Pj = Pj (Cj) and limitations to the total cost of these funds.

With the limitations associated with the ability to use a dictionary of all attributes, the problem arises of choosing a limited list (up to 2-3 characters). Here it is possible navigate in the location of the individual components of feature vector with respect to the boundaries of performance facilities monitoring.

Since for the boundary value of the parameter y j,, the

end of the vector X must be located on the boundary of operability, it is necessary to satisfy the equality

xiг, = a‘yjip . (20)

A correlation coefficient rij between the parameters may be an additional criterion for selection under statistical estimation. Since the maximum correlation coefficient provides a maximum amount of information

J(yj) = H(yi) - H(yX), (21)

/y

contained in the parameter yi . Here H(yi ) is the initial entropy; H(y/ ) is the conditional entropy of the object

/y1

after the measurement of parameter yj.

The use of a binary correlation algorithm allows for the participation of decision makers (DM) to formalize and automate the processes of input, processing, and recognition of an image.

IV. RES LIFE CYCLE PROCESSES STATE IDENTIFICATION

The solution of the RES life cycle identification, involves creation the rules that define the RES state.

The signs, that allow to distinguish the state of an object under monitoring, are efficiency indicators, which will have a given value or an extreme value for the selected state. To identify the RES state from the observed parameters in the monitoring process it is necessary to select a set of parameters, in which the value of performance indicators

will have given or extreme values.

The objects of observation - the parameters and characteristics of RES - can be considered as points of vector and functional spaces. For all pairs of points in the set Q, there is a binary relation of the comparative effectiveness: point х is more efficient than point y then and only then when (x,y) є Ф or in another writing х є y. The problem of allocating the core is solved by ensuring the RES life cycle - a set of maximal elements of the variables Х to a binary relation

Ф :X* = Max (Q, Ф) .It is assumed that the solution of the problem exists i.e. set Х* is not empty. In many problems, we may assume that the solution is a set Х* which consists of one element, and the relationship between the elements is established by means of functionals Л(х). For instance, the

point х is more effective than point у, when Л(х) < Л(у) or Л(х) > Л(у). It can be shown that in problems of determining the effective pixels х о є X* with constraints х є Qi,the functional f = кЛ'(х о), where Frechet derivative Л'(хо) at the point X0 is support functional to Qj, at the point X0 (i.e. (f,хо) < (f,х) for allх є Qj.

Thus, the problem of analyzing the results of observations in the monitoring process is reduced to the determination of support functionals at points of observation, which makes it possible to estimate the deviation of the observed points from effective points.

In terms of functional analysis [4,5]: let Q be some set in linear topological space E , E' be the dual space, хо є Q is an extreme point of Q, Kb is the cone of possible directions in Q at the point x0, Kk is the cone of tangent directions for Q in x0 . If the set of linear functionals, which are reference to the Q at the point x0, denotes the Q *, then Q*={f є Б'^(х) > f^0) for all х є Q, i.e. support functional and the extreme point хо є Q provide an opportunity to allocate a set Q . It can be shown that if Q is closed convex set, then Q* = Kk * , i.e. forms a cone formed by the set of linear functionals reference to Q in x0. The cone of

tangent directions can be determined from the Frechet derivatives of the operators (convex function), which connect the sets of parameters and performance indicators.

Let us consider the methods of finding K * for ways to specify K using different functionals.

Example 1. In the case of determination Q using affine sets: E=Ej xE2; Ej, E2 are the linear topological space, a performance characteristics set is defined E2, D is a linear operator from Ej to E2, K = {х є E,

х = (хьх2) = х2}, K* = {f є E',f = (fj,f2):

fj =-D*f2}, and as a reference separating function one

can use the expression

ґ(х) = (-D*f2,хх) + (f2,х2) = -(f2,D* хх - х2) .

Application of this function to separate the sets in the parameter space and the formulation of rules, that establish a correspondence between the sets of parameters and values of performance indicators, can provide the identification of states in the process of monitoring RES LC.

Example 2. A functional Л(х) in the linear space E has

a derivative Л'(х o,g) at the point x0 in the direction of g, there is

lim Л(х 0 + eh) - Л(х 0) s —— +0 є

f<x 0,g)

(22)

The functionals are correctly decreasing at any point and allow us to find the cone of decreasing directions. The func-

R&I, 2012, №1

19

tional Л(х), which is a set in a Banach space E , is differentiable (or Frechet differentiable) at X0, if there exists a linear functional f є E' such that for all g є E

Л(х0 + g) = л(х0) + (f,g) + o(|g|) ■ (23)

If Л(х) is differentiable at the point x0, then F(x) is correctly decreases at the point X0 and K = ^:(Л'(х о), g) < 0}. К is decreasing direction cone of the functional Л(х) at the point х 0, Л (х) (satisfies a Lipschitz condition in a neighborhood of х 0 є E, Л(х) is differentiable at X0 in any direction, and f(x0,g) as a function of g is convex if g є K and Л'(х0,g) g є E, E is a Banach space, Л(х) satisfies a Lipschitz condition in the neighborhood of X0 (for some є 0 > 0

|Л(хО - Л(х 2) <Р||хі - х 2| for all ||х і - х „|| <є 0,

||х2 - х0| < є0)and )Л'(х0,g) < 0 will be satisfied, then Л( X) is correctly decreasing at X0, and

K = {g: Л'(х0,g) < 0}.

Example 3. In the case of the set, which is not defined by a functional. If Q is a convex set, then the decreasing direction set Kb at X0 takes the form

Kb = MQ° - х0),Z> 0}

(i e. Kb = {g : g = к(х -х0), х є Q0,к> 0}).

Example 4. Р(х) is an operator from Ei in E2, that is differentiable in a neighborhood of x0, Р(х 0) = 0. Р'(х) is continuous in the neighborhood of x0, and Р'(х 0) displays Ei for all E2 (i.e. linear equation Р'(х 0)g = b has a solution g for every b є E 2 ), then the set of tangent directions K to set Q = {х : Р(х) = 0} at x0 is the subspace K = {g : Р'(х 0)g = 0}.

When the Р'(х0)Ei Ф E2, one can only state that K c {g : Р'(х0)g = 0}.

Example 5. Let х є Rm , Q = {х : Gi (х) = 0 , i = 1, ...,n}, where Gi(x) are functions continuously differentiable in a neighborhood of x0, G^0) = 0, i = 1,...,n , and vectors Gi'^ 0), are linearly independent. Then

K = {g єRn : ^Дх0),g) = 0,i = 1,...,n}.

Here E1 = Rm, E2 = Rn, Р(х) = (С1(х), ..., G^)), Р'(х0) is matrix mxn, i column is equal to Gi'^0) .

Example 6. In the process of monitoring it is necessary to determine whether the effective value of the function- RES characteristics w(z) is provided and if in the simplest case the extreme value of an differentiable objective function is provided for one variable, for which it is necessary to check whether the derivative is zero at the observed value of the

parameter. For multi-dimensional objective functions and their arguments, this problem may be conceded as part of set theory and functional analysis.

Formalizing in the observation problem of optimal setting, as one of the RES life cycle processes, lies in the fact that it is necessary to estimate optimal function of the process setting v(z) є M , where z is the parameter determining the numerical value of the required characteristic w (z) of a setting object to provide such a phase trajectory, which ensures the equality w(0) = c, w(Z) = d and extreme values Z

of the integral functional J Ф^^), v(z),z)dz, in the case 0

of a connection, given by the differential equation

dwd(z) =9(w(z),v(z),z) .

dz

In problems that requires maximum compliance of an optimised characteristic to some desired characteristic, minimum mean square deviation criterion is used

t W2(X) = (Y(X) - Y*)2, (24)

where Y* is the value characteristic desired or required by the technical project.

For the characteristic, which is given by a discrete set of points, the objective function

1 N * 2

W2(X) = - 2Yi(Y(X,pi) - Yi)2, (25)

N i=1

where N is the number of sampling points of the independent variable p; Y(X,pi) is the value of the optimized performance in the i-th point of the sampling interval; yi - optimized characteristic weight coefficient values that reflects the importance of the i-th point compared with other points (usually, 0 < yi > 1).

In some optimization problems it is necessary to ensure the excess or not the excess of some given level optimized characteristics. These criteria of optimality are implemented by the following functions:

- to provide excess of a given level

W3(X) = j 0 2 at ™^Н, (26)

I (Y- Y(X))2 at Y(X) < Yh;

- to provide unexcess of a specified level

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

W4(X)

0 ,2at Y<X'< Y* (27)

(Y(X)- Yb)2 at Y(X) > Yb,

where YH*,YB* are the feasible region lower and upper bounds for the characterization of Y(X).

If it is necessary that an optimized characteristic be within some permissible zone (boundary), a combination of two previous optimization criteria are used

W (X)

0 at YjH < Y(X ) < yB,

(Y (X ) - Yb)2 at Y(X) > Yb (Yh - Y(X))2 at Y(X) < Yh

(28)

20

R&I, 2012, №1

For those cases when it is necessary to implement only the shape of the curve while ignoring a constant vertical bias, there the shift criterion is used.

W6(X) = Zyi(Yi -Y(X,p;)-YCp)2, (29)

i=1

1 N *

where YCp = -X(Yi -Y(X,pi)).

N i=1

The form of objective function effects the computing process important characteristics and, convergence of the optimization process. Derivatives signs of the objective function for the controlled parameters are not constant throughout the feasible area, this circumstance leads to the ravine of the character (for example, problems of circuit design), which leads to high computing cost and requires special attention to the choice of optimization method.

Another feature of the objective functions is that they usually multiextremal and along with the global minimum have local minima.

A general class of problems identifying the set of efficient solutions consists of multi-criteria optimization problems. They are characterized by the fact, that binary relation on the set of alternatives is associated with a set of indicators, forming an efficiency vector criterion. This binary relation is generated by a variety of ways. Thus, if the

W(x) = (W1(x),...,Wm(x)) (30)

is vector criterion on the set of X, than binary relation can be Pareto ratio, or ratio of Slater. In other cases, a binary relation on X is defined by a system of preferences of the decision maker (DM). It is assumed that the main source of information is a person who has information sufficient to make a (unique) solution. Identification of the system of DM preferences represents one of the main problems in solving multiobjective problems. Usually a procedure for identifying the decision maker preferences are based on the language of vector evaluations of alternatives.

Algorithms based on scalarization - reducing to a parametric family of scalar optimization problems - are most illustrative.

IV. Conclusion

The scientific result of the paper is as follows: methods of informative features selection were developed that solve the problems for monitoring the RES life cycle, by classifying the states of RES and the life cycle processes in the feature space, each of them having a particular significance. This has allowed to find comprehensive criteria and formalize the selection process. Heuristic methods for selecting the criteria for the use of prototypes and basic information priorities are proposed, when there is an insufficient number of a priori data for correct classification.

The methods of solving the problem of choosing informative features of RES LC processes were researched that will allow a meaningful description of the processes for implementing RES LC decision-making procedures in the man-machine systems.

References

[1] Podinovskii V.V., Nogin V.D. Pareto-optimal solutions of the multicriteria problems. М.: Fizmatlit, 2007. 255p., (in Russian).

[2] Vishnekov A.V., Ivanova E.M., Safonov I.E. Methods of decision making in distributed CAD / CAM / CAE systems. Methods for complex estimation of options in terms of certainty of the initial information // Quality and CALS-technologies. 2006. № 3(11). P. 35-40 (in Russian).

[3] Kozhevnikov AM. Methods of CALS-technologies to optimize the choice of electrical and thermal elektroradio products regimes // ITPP. 2000. № 3. P.23-26, (in Russian).

[4] Biryukov S. I. Optimization. Some theoretical aspects. М.: MZ-Press, 2003. 246 p. (in Russian).

[5] Vasilyev V.I. Recognition systems. Reference book - Kiev.: Naukova Dumka, 1983. 422 p., (in Russian).

Nevlyudov Igor Shakirovich, Doctor of Technical Sciences, professor, head of TPRA Department of Kharkiv National University of Radio Electronics. Research interests: instrumentation technology, flexible manufacturing systems, robotics. Address: 14 Lenin Avenue, Kharkov 61166, Ukraine, tel. (057) 702-14-86. E-mail: tapr@khture.kharkov.ua Andrusevich Anatoly Alexandrovich, Director of Aviation Krivoy Rog College. Research interests: instrumentation technology, process monitoring, production of electronic systems, improving the reliability of electronic systems. Address: 1 Tupolev St., Krivoy Rog 50045, Ukraine, tel. (0564) 27-56-79. E-mail: uchebotdel@kk.nau.edu.ua

Khatnyuk Igor Stanislavovich, Postgraduate student of TPRA Department, Kharkov National University of Radio Electronics. Research interests: instrumentation technology, designing of flexible PCBs, improving the reliability of electronic systems functioning. Address: 14 Lenin Avenue, Kharkov 61166, Ukraine, tel. (057) 702-14-86. E-mail:

tapr@khture.kharkov.ua

R&I, 2012, №1

21

i Надоели баннеры? Вы всегда можете отключить рекламу.