Научная статья на тему 'Time Series Analysis by Soft Computing Methods'

Time Series Analysis by Soft Computing Methods Текст научной статьи по специальности «Математика»

CC BY
95
41
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
Fuzzy Time Series / Fuzzy Tendency / Elementary Tendency / Summarization
i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Qualitative evaluation and comparison of changes of indications of objects having different nature is used by designers, managers, people making decisions (PMD) and experts to make the decisions more reasonable. For suport of such activity on the analysis of changes of data connected with certain dates and time intervals, models of fuzzy time series are applied. In this article a model of fuzzy tendency the carrier of which is a fuzzy time series and its variety — elementary tendency model — is offered. The offered models are aplied for solution of the problem of summarization of fuzzy time series in terms of tendencies.

Текст научной работы на тему «Time Series Analysis by Soft Computing Methods»

Time Series Analysis by Soft Computing Methods

N. Yarushkina Information Systems Department Ulyanovsk State Technical University, UlSTU Ulyanovsk, Russia jng@ulstu.ru

Abstract—Qualitative evaluation and comparison of changes of indications of objects having different nature is used by designers, managers, people making decisions (PMD) and experts to make the decisions more reasonable. For suport of such activity on the analysis of changes of data connected with certain dates and time intervals, models of fuzzy time series are applied. In this article a model of fuzzy tendency the carrier of which is a fuzzy time series and its variety — elementary tendency model — is offered. The offered models are aplied for solution of the problem of summarization of fuzzy time series in terms of tendencies.

Keywords Fuzzy Time Series, Fuzzy Tendency, Elementary Tendency, Summarization.

I. Introduction

In connection with increasing volume and speed of storage of data connected with certain dates and time intervals in data bases, the new technology of data analysis Data Mining for Time Series Data Base (TSDM) is actively developed. Forming these time sequences on the basis of domains permits to consider them in the form of time series (TS). A number of distinctions of such time sequences from traditional (classic) TS, considered in statistic theory of analysis and forecast of TS, can be noted: these time sequences are short TS; such time sequences can be represented not only by numerical but also by linguistic values; it is difficult or impossible to determine suppositions about random nature of values for such time sequences. Distinctive properties of Time Series Data Base generate changes in the aggregate of aims and tasks which form the direction Data Mining for Time Series Data Base[1]: segmentation, clusterisation, classification, indexing, summarization, disclosing anomalies, frequency analysis, forecasting, extraction of associative rules. At present the methodology of solving the indicated problems is formed on the basis of methods and models of intelligent analysis, representation and processing of data [2, 3, 4, 5]. The basis of the offered methods of intelligent data analysis is the concept of the fuzzy time series (FTS) constructed on the base of levels of TS [5, 6] or their first differences [3]. In this article an original model of FTS on the basis of fuzzy tendencies is offered. The offered model describes the behaviour of FTS in the form of the sequence of fuzzy tendencies represented by information granules. Application of information granules for problems of Data Mining for Time Series Data Base is

presented in the work [7]. An indisputable advantage of data granulation is the possibility of representation of models of TS at different levels of abstraction in the linguistic form, which permits to widen the accessibility of using models of TS by application users and to improve their interpretability.

In the second part of the article models of a fuzzy time series and a fuzzy tendency are considered. In the third part a special scale for generating an elementary fuzzy tendency is given. Procedures of granulation of FTS in the basis of FT are presented in the fourth part, there is also described the solution of the problem of summarization of TS in terms of FT and presented the experiment on the solution of the problem of summarization of a synthetic TS as the problem of determination of a fuzzy tendency.

II. Fuzzy tendency model

A. Concept of linguistic evaluation

One of the problems of the analysis of TS is the analysis of FTS behaviour, that is change of values of TS levels. The solution of the problem of the analysis of TS behaviour expressed in linguistic form can serve as the linguistic evaluation of the behaviour. Linguistic evaluations (LE) are the means of qualitative evaluation and comparison of characteristics and indications of objects having different nature used by designers, managers, people making decisions (PMD), experts. An important property of linguistic evaluations is wide application in practice in making decisions for expressing knowledge about the degree of correspondence of the object being observed or its characteristics to some objective or subjective criterion. The stated property determines the class of absolute LE, which reflects the static aspect of evaluation. The following expressions can be examples of such evaluations: "Satisfactory", "Good", "Bad", "Big", "Small", "Medium", etc. The semantics of absolute linguistic evaluations depends on the context of the environment in which they are used and modeled by fuzzy sets.

Another important property of linguistic evaluations is conditioned on the possibility of ranking them, it permits to present the aggregate of LE in the form of some system with relations. Binary relations formed on the set of absolute LE generate comparative linguistic evaluations by different criteria such as "More", "Less", "Approximately Equal", "Earlier",

"Later", "Rather", "Better", etc. Comparative evaluations made on the basis of absolute LE can represent changes in different universes: in the universe of objects, in the time universe, in the universe of problems and they express dynamic aspect of evaluation. The semantics of comparative evaluations is also context-dependent and can be modeled on the basis of fuzzy sets.

It is noted in the article [4] that linguistic evaluation has indications expressing the degree of intensity of this evaluation. These indications can be represented in the linguistic form, usually used by people: "Very", "Insignificantly",

"Approximately", etc.

Context-dependent linguistic evaluations considered above are given in expert way as a rule, and they are called expert evaluations. In case of impossibility of receiving expert evaluations of indications of objects, abstract linguistic evaluations are used, let us consider such evaluations among the class of context-free linguistic evaluations.

Let the aggregate of all linguistic evaluations forms the

finite set X ={ Xj }, where jE[1,nj]. Let us call the linguistic

evaluation of the indication x from the universe B a fuzzy

label, if the fuzzy set is determined for it such that Xj={< wm

, p ~ (wm) >1 wm E w, w ^ B }, where wm - is the carrier of the

X j

fuzzy set, p ~ (wm) - is the membership function.

Xj

B. Model and kinds of a fuzzy tendencys Let us introduce definitions.

Let some time series (TS) Xt={ti,xi} is given, iE [1,n], n - is the quantity of members of the series, xi E B, ti E Bt.

Let us call the ordered sequence of observations over some phenomenon the states of which change in time if the value of the state at the instant ti is expressed with the help of the

fuzzy label Xt E X , i E [1,n] , n - is the quantity of

members of series, a fuzzy time series (FTS). That is we

represent a fuzzy time series in the form Xt ={4 X ,}, where

X i - i-th fuzzy set (fuzzy label), U - i-th value of the instant of time, t1 < tj < tn, n - is the quantity of members of FTS. Any TS can be represented in the form of the sequence of fuzzy labels X. ={< wm, p ~ (wm) >1 wm E w, w ^B } on the basis

Xj

of linguistic (context-dependent or context-free) evaluation of levels of TS xi E B.

Let us call the fuzzy label Tk e X expressing the character of

change (systematic motion) of the sequence of fuzzy values of

FTS Xt in the given interval of time the fuzzy tendency (FT)

of a fuzzy time series. A fuzzy tendency determines the nature of FTS not in analytic, but in the linguistic form.

Each fuzzy tendency Tk of the fuzzy time series X can be

represented by the fuzzy set Tk ={<Ti,/u~ (xi) >, ie [1,n]}

with the function of membership in the fuzzy time series /u~ (xi), where xi is the model of the following form:

xi = <vi,ai, At i >, where

vi - is the type of the tendency. Let us compare fuzzy labels Tk and base types of tendencies "Increase", "Decrease",

"Stability". On the basis of base types derivative types of tendencies, such as "Fluctuations", "Chaos", "Load", "Idle time", etc., can be formed. ai- intensity of the tendency.

Ati - duration of the tendency.

In case if Ati =1, let us consider the fuzzy tendency among the class of elementary fuzzy tendencies, if Ati = n-1, we consider the fuzzy tendency among the class of general fuzzy tendency of FTS, if 1<Ati< n-1, let us consider the fuzzy tendency among the class of local (derivative) fuzzy tendencies of FTS.

In the aspect of content an elementary fuzzy tendency models changes between two neighbouring values of the fuzzy time series Xi, Xi+1, and it can be compared with the instant of time ti of FTS. A local fuzzy tendency is determined between two chosen values of the fuzzy time series Xi and

Xj, when i< j can be compared with the instant of time ti. Any

local FT can be expressed by the sequence of elementary ET. The general fuzzy tendency characterizes the behaviour of all FTS and it is representable in the form of the sequence of local and, therefore, also elementary FT. Thus, the time series of elementary FT of the form { ti, Tk ( ti, X ,)}, i e [1,n-1] can be

made for any FTS.

The analysis of fuzzy labels used when evaluating levels and behaviour of a time series permits to make the following conclusions:

1. Models of elementary, local, general FT of a fuzzy time series have common structure.

2. Local and general fuzzy tendencies of FTS can be expressed through the time series of elementary FT.

3. A time series of elementary FT is an invariant method of linguistic representation of behaviour of any FTS.

4. Representation of time series in the form of fuzzy time series and time series of elementary fuzzy tendencies permits to take into account additional knowledge in the form of semantics of application area during their analysis owing to use of context-dependent fuzzy labels.

In the next part we will introduce a special scale for generation of the model of the elementary fuzzy tendency.

III. ACL-SCALE FOR GENERATION OF THE MODEL OF THE ELEMENTARY TENDENCY

In this part a special linguistic scale is offered as a tool of both absolute and comparative linguistic evaluation - ACL-scale (Absolute & Comparative Linguistic). This scale will be

applied for construction of the model of the elementary fuzzy tendency.

The model of ACL-scale Sx for determination of absolute and comparative linguistic evaluations is representable in the form of linguistic variable with relations

Sx = <Name_ Sx, X, B, G, P, TTend, RTend>,

where Name_ Sx - is the name of ACL-scale, X - is the base term-set of absolute LE (linguistic name of gradations), for

example, X ={Bad, Satisfactory, Good, Excellent, ..},

Xt є X , B - is the universal set on which the scale is

determined, xeB . G - are syntactic rules of deduction (generation) of chains of evaluation propositions (derivative of terms which do not enter into the base term-set), P - are semantic rules which determine membership functions for each term (they are usually given in an expert way),

TTend(Xt, Xj) - is the linguistic relation fixing the type of

change between two evaluations хі , Xj of the scale,

RTend(Xt, Xj ) - is the linguistic relation fixing the intensity

of difference between two evaluations Xt, Xj of the scale.

The relation TTend(Xt, Xj) is the fuzzy linguistic relation

which is applied for determination of comparative linguistic

evaluation Vj = TTend( Xj , X j ) which characterizes the

direction of change (increase or decrease) of the value of the

absolute LE Xt with respect to Xj which can be represented

by linguistic expressions, for example, by values from the set {INCREASE, DECREASE, STABILITY, ZERO}. Let us

note that each evaluation Vj= TTend(Xt, Xj) is representable

by its fuzzy set. The relation TTend is antireflexive, antisymmetric and transitive:

V~ є X TTend (~,~) = 0,

V~,~ є X (~ Ф ~) TTend (x, ~)л TTend (~,~ ) = 0 Vx,~ ,x є X TTend (x,x) > TTend (x,~ ) л TTend (~, z ) The stated properties of the relation TTend permit to classify it as an ordering relation. Then the aggregate of all possible evaluations V={vy } forms the fuzzy ordinal scale Sv

=<Name_ TTend, V, X, Gv, Pv>.

The relation RTend (Xt, Xj ) is also the fuzzy linguistic relation applied for determination of comparative linguistic evaluation a = RTend (Xt, X j ) which characterizes the

degree of difference, "non-metric" distance between Xt, Xj

which can be expressed linguistically, for example, by values from the set {BIG, MEDIUM, SMALL, ZERO}. This evaluation ау is also representable by its fuzzy set. The relation RTend is antireflexive and symmetric:

V x є X RTend (x ,x ) = 0,

Vx ,x є XT RTend (x , у )= RTend (x ,x )

The indicated properties of the relation RTend permit to classify it as a relation of difference, with it the aggregate of all possible evaluations A={a¿} forms the fuzzy scale Sa

=<Name_ RTend, A, X, Ga, Pa>.

Thus, the ACL-scale Sx for determination of linguistic evaluations is the two-level scale. At the first level of hierarchy from its universal set the ACL-scale Sx permits to

determine linguistic evaluations Xt for values xsX which

characterize their qualitative aspects. Such linguistic evaluations relate to the class of absolute LE. At the second

level of hierarchy for values Xt and Xj - linguistic

evaluations of their changes (va) which characterize qualitative aspects of differences or "difference of the first order" by scales Sv, Sa. Such linguistic evaluations are related to comparative LE.

Let us consider peculiarities of ACL-scales. The offered linguistic ACL-scale Sx is related to the class of fuzzy evaluation scales which enter into the class of ordinal scales, difference and the degree of difference can be additionally evaluated in it. This property permits to consider the linguistic evaluation ACL-scale Sx as quasi-interval and to determine "evaluation" and "computing" operations for it.

Let us introduce the following "evaluation" operations of the ACL-scale Sx generating linguistic evaluations:

1. The operation of determination of the absolute

linguistic evaluation Xt by the value of characteristic of the object xj being evaluated

Xt =Fuzzy(xj), xj єВ, Xt є X.

2. The operation of determination of the value of

characteristic of the object xj being evaluated by the

absolute linguistic evaluation Xt

xj = DeFuzzy( xi),xj єВ, xt є X.

3. The operation of determination of the type of

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

difference (comparative linguistic evaluation)

V = TTend(Xi, Xj), Xi є X, Xj є X .

The operation TTend is non-commutative.

4. The operation of determination of intensity of

difference (comparative linguistic evaluation)

ay = RTend (x, Xj), Xj є X, xj є X.

The operation RTend is commutative.

Let us determine the aggregate of "computing" operations of the ACL-scale for generated linguistic evaluations:

1. Computing the absolute linguistic evaluation

X j = Comp( Xt, Vjj, aj).

2. The difference of intensities of differences

ajj =Diff(a, aj)

3. The union of intensities of the difference

a,-,- = Union(ai, a).

4. The intersection of intensities of differences

a,j = Inter(ai, a,).

Operations Diff Union, Inter are commutative,

associative, bounded.

Linguistic evaluations received by the indicated linguistic ACL-scale will be used in the next part as semantic-dependent for solution of the problem of summarization of FTS in terms of fuzzy tendencies.

IV. Application of elementary tendency model for

SUMMARIZATION OF FUZZY TIME SERIES

Let us consider the application of the offered ACL-scale in solution of the problem of summarization of FTS as the problem of identification of its general fuzzy tendency.

For this purpose, let us design the hierarchical granular model for the initial time series X={ti,xi}, i G [1,n] , n - is the quantity of members of the series. Let us introduce four levels of granulation forming of which corresponds to the bottom-up approach. The zero level of granules will be represented by fuzzy labels of the initial TS. For forming the granules of the zero level, let us use the "evaluation" operation of the ACL-scale: Xi =Fuzzy(xi).

Let us compare fuzzy tendencies of FTS and information granules which have structural commonness. Let us define the operation of granulation of the first level in the form of the functional ETend forming the granules of elementary fuzzy

tendencies: xi = ETend(xi, xi+1 ), i G [1,n-1] , n - is the

quantity of members of FTS.

The functional ETend generates granules of elementary fuzzy tendencies on the basis of "evaluation" operations TTend and RTend of the ACL-scale and can be realized in the subsystem of fuzzy deduction with rules of the following form:

R1 := IF Xt is A11 AND Xt+1 is A12 THEN vt is B1 AND atis N1

Rm := IF Xt is Am1 AND Xt+1 is Am2 THEN Vt is Bm AND at ¡s Nm

The semantics of rules of realization of the functional ETend is represented in the following table.

8. Ze Go Inc Bi

9. Ze Ex Inc VeBi

10. Bad Ze Dec Sm

11. Bad Sat Inc Sm

12. Bad Go Inc Me

13. Bad Ex Inc Bi

14. Sat Ze Dec Me

15. Sat Bad Dec Sm

16. Sat Go Inc Sm

17. Sat Ex Inc Me

18. Go Ze Dec Bi

19. Go Bad Dec Me

20. Go Sat Dec Sm

21. Go Ex Inc Sm

22. Ex Ze Dec VeBi

23. Ex Bad Dec Bi

24. Ex Sat Dec Me

25. Ex Go Dec Sm

The following abridgements were used in this table: Ze(Zero), Bad(Bad), Sat(Satisfactory), Go(Good), Ex(Excellent) for fuzzy values of FTS; Inc(INCREASE), Dec(DECREASE), St(STABILITY) for values of types of changes; Bi(Big), Me(Medium), Sm(Small) and modifiers, such as Ve(Very), Si(Significantly), No(No) for values of intensity of changes.

Let us define the operation of granulation of the second level in the form of the functional STend forming the granules of local fuzzy tendencies: t, =STend(ri, ts), where t, ts are granules of the first level.

The introduced functional STend is computed as the result of the union of one-type elementary tendencies on the base of the "computing" operation Union of the ACL-scale. Then the union Tj=STend(t, zj is the such fuzzy tendency for which

Vj= v , a, = Union (a, as), fj,j = Hi u ps, the duration Atj =Ati +Ats.

The operation of union of one-type tendencies defines the granules of the second level.

The generalized form of rules of granulation of the second level on the basis of the functional STend has the form:

R11 := IF vt is Inc THEN

IF at is A11 AND at+1 is A12 THEN TInc is B1

Rm1: = IF vt is Inc THEN

IF a. is A , AND at+, is A 2 THEN TInc is B ,

t ml t+1 m2 m ’

R12 := IF vt is Dec TH EN

IF a t is A u AND a t+1 is A12 THEN TDe c is B1

Rm 2: = IF v t is Dec THE N

IF a t is Am1 AND a t+1 is Am 2 THEN TDe c is B m

The semantics of rules of granulation of the second level is given below:

TABLE I. Table of rules

No. of the rule X t X M Vt= TTend( X t, X +1 at=RTend( X t, X t+1)

1. Sat Sat St Ze

2. Go Go St Ze

3. Ex Ex St Ze

4. Bad Bad St Ze

5. Ze Ze St Ze

6. Ze Bad Inc Sm

7. Ze Sat Inc Me

TABLE 2. TABLE OF RULES LEVEL 2

TInc TDec Tend

1 Bi Me Inc

2 VeSm Sm Dec

4 Me Bi Dec

5 Bi Sm Inc

6 Me Sm Inc

7 Sm Me Dec

On the basis of introduced functionals we defined the procedure of summarization of TS as the procedure of identification of the fuzzy tendency Tend.. This procedure is the sequential generation of information granules which model TS at different abstract levels. The result of the procedure of summarization of FTS is the granule of the general fuzzy tendency which is the convolution of elementary tendencies into the linguistic evaluation of behaviour of a FTS:

ETend( X., X.+1 STend(t, tT ^ GTend (t,ts) .

8-

Figure 1. The time series No. 1. The result of summarization procedure: "General tendency =Increase"

Figure 2. The time series No. 2. The result of summarization procedure "General tendency=Decrease".

The offered approach to the solution of the problem of summarization of FTS on the basis of fuzzy tendency model and granular computing was realized as software in the system generating artificial time series with noise. On figures 1, 2 examples of execution of the procedure of summarization of the artificial TS and its results are presented.

References

[1] Batyrshinand L. Sheremetov. Perception Based Time Series Data Mining for Decision Making. /IFSA’07 Theoretical Advances and Applications of Fuzzy Logic, pp.209-219.

[2] N.G. Yarushkina Principles of the theory of fuzzy and hybrid systems: Training aid. - Moscow: Finances and statistics, 2004. - 320 p.

[3] M. §ah ,K.Y. Degtiarev. Forecasting Enrollment Model Based on FirstOrder Fuzzy Time Series. international Conference on Computational intelligence (iCCi) 17-19 December 2004, istanbul, Turkey

[4] Dvorak A.,Novak V.:Formal Theories and Linguistic Description. Fuzzy Sets and Systems, 143(2004), 169-188 .

[5] Song, Q., Chissom, B.S. Fuzzy time series and its models. Fuzzy Sets and Systems , 54 (1993) 269-277

[6] Huarng, K. Heuristic models of fuzzy time series for forecasting. Fuzzy Sets and Systems, 123: 369-386. Young, The Technical Writers Handbook, Mill Valley, CA: University Science, 1989.

[7] L .A. Zadeh. Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic. Fuzzy Sets and Systems , 90(1997), 111-127

i Надоели баннеры? Вы всегда можете отключить рекламу.