Научная статья на тему 'Identification of multidimensional technological processes with dependent input variables'

Identification of multidimensional technological processes with dependent input variables Текст научной статьи по специальности «Математика»

CC BY
45
11
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
A-PRIORI INFORMATION / IDENTIFICATION / NONPARAMETRIC MODEL / NONPARAMETRIC ALGORITHMS / H-MODEL / FRACTIONAL DIMENSION SPACE / АПРИОРНАЯ ИНФОРМАЦИЯ / ИДЕНТИФИКАЦИЯ / НЕПАРАМЕТРИЧЕСКАЯ МОДЕЛЬ / НЕПАРАМЕТРИЧЕСКИЕ АЛГОРИТМЫ / H-МОДЕЛЬ / ПРОСТРАНСТВО ДРОБНОЙ РАЗМЕРНОСТИ

Аннотация научной статьи по математике, автор научной работы — Medvedev Alexander V., Mihov Eugene D., Ivanov Nikolay D.

In the article the problem of identification of multidimensional technological processes is studied. The question of modelling H-processes with several output variables is considered. Amethodof analysis of the training sample that determines thepossibility of modelling an H-process with several output variables is proposed.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Идентификация многомерных технологических процессов с зависимыми входными переменными

В статье рассматривается задача идентификации многомерных технологических процессов. Изучен вопрос моделирования H-процессов с несколькими выходными переменными. Предложен метод анализа обучающей выборки, при помощи которого определяется, возможно ли создать модель исследуемого Н-процесса с несколькими выходными переменными.

Текст научной работы на тему «Identification of multidimensional technological processes with dependent input variables»

УДК 519.87

Identification of Multidimensional Technological Processes with Dependent Input Variables

Alexander V. Medvedev*

Siberian State University of science and technologies Reshetnev Krasnoyarsky Rabochy, 31, Krasnoyarsk, 660037

Russia

Eugene D. Mihov^ Nikolay D. Ivanov*

Siberian Federal University Svobodny, 79, Krasnoyarsk, 660041

Russia

Received 26.05.2017, received in revised form 13.06.2018, accepted 26.07.2018 In the article the problem of identification of multidimensional technological processes is studied. The question of modelling H-processes with several output variables is considered. A method of analysis of the training sample that determines the possibility of modelling an H-process with several output variables is proposed.

Keywords: apriori information, identification, nonparametric model, nonparametric algorithms, H-

model, fractional dimension space.

DOI: 10.17516/1997-1397-2018-11-5-649-658.

Introduction

Identification of stochastic objects is often reduced to identification of static systems with delay. This is due to the fact that some output variables of the object are controlled by much larger intervals than the input variables. For example, some variables are measured electronically (in this case, discrete control can be sufficiently small), while the other variables are controlled by chemical analysis or physical-mechanical tests (in this case the discrete control AT is large, i.e. AT » At).

The most common scheme a discrete-continuous process under study can be represented as follows on Fig. 1.

The following notation is used on Fig. 1: A is the investigated object (the process); z(t), z(t), and z(1t) are the output vectors of the process; u(t) is the vector of control actions; -it is an uncontrolled but measured input vector of the process; U(t) is the input vector of uncontrolled and not measured process variables; Ztt is the casual influence; U(t) : i = 1, k are the process variables controlled in the object; t is continuous time H^, Hu, Hx, Hz, Hq, Hu are the communication channels corresponding to various variables, including control devices, devices for measurement of observed variables; ¡jt ,ut,xt,wt are measurement of -(t, z-t), z-t), u-t) in discrete time; h^(t),hu(t),hx(t); hUl (t) ...hUk (t) are measurement errors of the corresponding variables of the process.

* Saor_medvedev@sibsau.ru 1 edmihov@mail.ru

* blackhawk009@ yandex.ru

© Siberian Federal University. All rights reserved

Fig. 1. The general scheme of a process

It should be noted that processes related to cybernetics and the automatic control theory, rather than processes from probability theory (like Wiener or Gauss processes) [1] are considered here.

1. Levels of a priori information

There are systems with different levels of a priori information [2]:

• systems with parametric uncertainty. The parametric level of a priori information assumes presence of a parametric structure of the model and some characteristics of random interference such as zero mathematical expectation and bounded variance. Iterative probabilistic procedures are commonly used to evaluate parameters.

• systems with nonparametric uncertainty. This level does not assume presence of a model, however, some knowledge about qualitative characteristics of the process such as ambiguity or unambiguity of its characteristics, linearity for dynamic processes or the nature of their nonlinearity is required. Methods of nonparametric statistics are used to solve problems of identification.

• systems with parametric and nonparametric uncertainty. The task of identifying multiply connected systems, when the amount of initial information does not correspond to the types described above, is important in practice. The problem of identification is formulated under the conditions of both parametric and nonparametric a priori information. Models represent an interconnected system of parametric and nonparametric relations in this case.

2. General description of ^-processes

The situation when the input actions are stochastically dependent often appears in the modelling and control of inertia-free objects with delay. The nature of dependence between the input variables is often unknown. The described situation is typical for discrete-continuous processes dominating in metallurgy, coal industry, construction industry, etc.

The case when the input variables have a stochastic dependence is the special case, which may be classified as a new class of identification tasks. Such processes are called HA-processes [3]. In other words, an Hx is a process with input variables stochastically dependent.

The space of input and output variables of the process is the domain of the process. When input variables have stochastic dependence, the process is localized in a subdomain. Any process with proportional input variables is an H-process: steel smelting (dependent variables are carbon and iron), dough kneading (dependent variables are flour and water).

In three-dimensional space an HA-process may look like a tube, a line or a part of a plane. In the simplest case an HA-process is a line (Fig. 2).

Fig. 2. Example of an ffA-process. The case of linear dependence between input variables

In this example the relation between u2 and u1 is linear; noise does not affect this process. In real production noise is always present, but the linear dependence between input variables may exist. Of course, complex processes may have a non-linear dependence between input variables (Fig. 3).

Fig. 3. Example of an ffA-process. The case of non-linear dependence between input variables

Such processes are more complicated than the processes depicted in Fig. 2. The researcher must know the non-linear dependence between variables for successful modelling of the process. Similar processes are more often encountered in industry than those considered earlier.

t

Nethertheless, this process is represented by a curve. This means that if the input variables are given even with a slight deviation from the dependency, the process does not exist.

An HA-process close to a real process is shown in Fig. 4. The faceted structure indicates the domain of existence of an H-process.

Fig. 4. Example of an ff -process close to a real process

3. Indicator function

If we try to recover a parametric model of an HA-process without taking into account its tubular form, then only points of Q^(u, x) will be used. In particular, even in the simplest case there are infinitely many models of a line in the for of a plane (see Fig. 5).

Fig. 5. Parametric models of an ff-process

In the case of a non-linear dependence a recovered surface may also be not unique. To rectify the situation we consider the indicator function. Using the indicator function we transform the standard model

Xs = f(u,as), (1)

into

Xs = f (u,as)I (u). (2)

The indicator function I(u) is defined as follows

i;

l, u G (u),

I (u) = { A (3)

v ; V u / QH(u). v '

Note that the domain ilH(u) is not known, only the sample = 1,s is given. The

estimate xs can not be calculated if the indicator function is zero, i.e. for such values of the vector components u e Q(u) the process does not exist. If the indicator function I(u) is equal to one for any value, u e Q(u), then the model converts to the standard one. As an estimate of the indicator function Is(u) we can take

s k

Is(u)= sgn^n $c-1(uj - uj)), (4)

i=1j=1

where $ is a core function.

A core function $( * ) is a function satisfying the following conditions

1 /y u^l) du =it (5)

cW y cs J

— <X>

lim — [ v(u)®(u ~ ui )dt = y(ui). (6)

Cs J\ Cs s

The triangular core

1^0, | . | > l

is used for the indicator function estimate.

#A-processes with several output variables

HA-processes described above have one output variable. Certainly, this is unrealistic, as a rule real technological processes are characterized by sets of output variables (xj ,j = 1,m, where m is the number of output variables. It is logical to assume that some of the output variables vary not in Qjx(u) but in its sub domain QH\(u).

An example of such process with two input and two output variables is shown in Fig. 6.

Existence of such processes implies appearance of qualitatively new tasks in the field of modeling and control.

#A-processes with intersecting outputs

Consider a special case of Hx processes with several output variables. Assume that it has two input and two output variables; the training sample has size n: (x1i,x2i,uii),i = l,n. The structure of the process is shown in Fig. 7.

Here we use the notation: (u) is the domain of input variables; Q^1^) is the domain of input variables for the output variable x1; Qf2 (u) is the domain of input variables for the output variable x2; Qf12(u) is the intersection of Qf1(u) and QH2(u); u1,u2 are the input variables; x1; x2 are the output variables.

The output variable x1 is defined only in Qf1(u). Outside this domain a subprocess giving x1 does not exists. The same holds for the output variable x2 and the domain Qf2(u). It follows that a process with two output variables x1 and x2 exists only in Qf12(u).

Fig. 6. Example ffA-processes with several output variables

Fig. 7. A process with two outputs having common domain

A researcher does not know about existence of QH1(u), QH2(u), and Q^12^). This can adversely affect the accuracy of the design model (it may forecast even in the case when the process does not exist, which means that the forecasts are incorrect). A logical solution in this case is to use two indicators for each output variable.

With indicators the model looks as follows:

xl = Ii(ui)l2(uH.)fi(ui,as), x2 = Ii(ui)l2{ui)f2{ui,as),

(8)

where Ij (u) is the indicator function that determines if the the vector of input variables u belongs to the domain Q^'(u), x1 and x2 are the output variables of the process.

The indicator function s have the form

Ii(u) = <

I2(u) = <

1, u e üHi(u),

ue ^Hl( u),

ri, u e üH2(u),

o, u e nH2(u).

In general, the model of an ff-process with k output variables looks as follows:

k

xi = fi(u, as) JJ Iv(u),

v=0

..................................... (10)

k

Xk = f k(ui, as) JJ Iv(ui).

v = 0

If one of the output variables x is defined for any value of the input variables from Q(u), then the indicator function is identically 1 and does not affect the expression k=0 Iv (u).

If a process is not an ff-process, then all indicator functions are identically 1, and

becomes

xi = f i(ui ,as),

nk=0 Iv (u) = 1. In this case the model becomes

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

.................................. (11)

xk = fk(ui, as),

which proves that model (6) is a more general form of a process model with several output variables.

Let us consider an example of a process model with indicator functions. Take a simple system of equations:

xi = ui + u-2, 1 2 (12) x2 = u + 0.6u,2,

where u e (0,3); the noise that affects the process is 5%. This is an Hx process. For the first output variable we have the following model:

u2 =sin(u1/1.3) * 2 + £1, ^ e (-0.2; 0.2). (13)

For the second output variable we have

u2 = 3 - u1 + £2, £2 e (-0.2; 0.2). (14)

The training sample is given in Fig. 8.

Fig. 8 demonstrates that some part of the training sample does not belong to ^H12. We process the elements of the training sample by multiplying them by the indicator functions.

Fig. 9 demonstrates that only those elements that belong to QH12 remain after applying the indicator functions. To find the intersection of QH1 and QH2 a sliding test can be employed. The rule for determining whether an element of the training sample belongs to the intersection is:

M(ui) = H Iv(u),i = 1,n.

v=0

The function M(ui) can assume one of the following values:

i 1 u f= QH12

M (u) = \l:ut e nHU (15)

This analysis allows to judge about characteristics of this intersection, as well as whether it is empty or not.

Fig. 8. The training sample before processing

Fig. 9. The training sample after processing

HA-processes without intersecting outputs

If M(ui) =0, i = 1,n, the output variables of the process under investigation do not have a common domain. This case is depicted on Fig. 10.

This means that the process cannot exist with such set of output variables, for example if they exclude each other. Such a situation is typical for active processes (processes involving people), organization processes, rarely for technological processes. In this case an expert has to change the set of output variables so that they have non empty intersection domain. It should be noted that this clearly implies an action of an expert.

The following algorithm can be used to find sets of output variables with intersection to present them to an expert: One checks the values of M(u, x) of the process using only two output variables xi and xj, i = 1,k — 1, j = i,k, where k is the number of output variables. The pair of output variables for which M (u, x) = 1 should be marked as having non empty intersection; thus, we find xp,p = 1, q, where q is the number of found pairs. To each found pair we add one of the remaining variables and proceed as above to find appropriate triples of output variables. This operation is repeated until addition of a new variable leads to M(u, x) = 0.

Fig. 10. An ff-process without intersecting outputs

The expert should choose one set of output variables from the obtained ones.

The proposed algorithm is a 'greedy' one, but it does not miss the global minimum in this problem. The condition M(u,Xi) = 0, M(u,Xj) = 1 must be met to skip the global minimum, where Xj is a vector of output variables that includes all output variables from Xi and one more output variable. This situation is excluded, because if M(ui) = nV=0 (u) = 0, M(Uj) will also be equal to 0.

The described case is a new one for identification tasks. It should be added also that a lot of active processes fall into the situation shown above, therefore the study of this particular situation is very relevant.

Conclusion

The problem of modelling ff-processes was considered. A method of modelling ff-processes based on the modification of the parametric model using the indicator function is proposed.

A new kind of ff-processes, namely multidimensional ff-processes, is considered. The case when the intersection of domains of variables is not empty is described, and a method of modelling such processes is proposed.

Also the case when the intersection of domains of variables is empty is considered. A method for selecting a new set of output parameters that has non empty intersection is proposed.

The research was supported by a grant from the Russian Science Foundation (project no. 1619-10089).

References

[1] J.L. Doob, Stochastic Processes, Wiley Classics Library, 1990.

[2] N.G.Zagoruiko, Cognitive data analysis, Geo. Novosibirsk, 2013 (in Russian).

[3] A.V.Medvedev, Analysis of the data in the problem identification, Computer analysis of modelling data. Minsk, BSU, 2(1995), 201-206 (in Russian).

Alexander V. Medvedev, Eugene Б. ЫШСУ, Nikolay Б. Ivanov Identiflcation Multidimensional...

Идентификация многомерных технологических процессов с зависимыми входными переменными

Александр В. Медведев

Сибирский государственный университет науки и технологий им. М. Ф. Решетнева

Красноярский рабочий, 31, Красноярск, 660037

Россия

Евгений Д. Михов Николай Д. Иванов

Сибирский федеральный университет Киренского, 26, Красноярск, 660041

Россия

В статье 'рассматривается задача идентификации многомерных технологических процессов. Изучен вопрос моделирования Н-процессов с несколькими выходными переменными. Предложен метод анализа обучающей выборки, при помощи которого определяется, возможно ли создать модель исследуемого Н-процесса с несколькими выходными переменными.

Ключевые слова: априорная информация, идентификация, непараметрическая модель, непараметрические алгоритмы, Н-модель, пространство дробной размерности.

i Надоели баннеры? Вы всегда можете отключить рекламу.