Научная статья на тему 'Influence of noise on the DTW metric value in object shape recognition'

Influence of noise on the DTW metric value in object shape recognition Текст научной статьи по специальности «Компьютерные и информационные науки»

CC BY
94
15
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
IMAGE PROCESSING / PATTERN RECOGNITION / METRIC / DTW / NOISES / ОБРАБОТКА ИЗОБРАЖЕНИЙ / РАСПОЗНАВАНИЕ ОБРАЗОВ / МЕТРИКИ / ШУМЫ

Аннотация научной статьи по компьютерным и информационным наукам, автор научной работы — Gostev Ivan M., Sevastianov Leonid A.

The paper sets out one of the methodologies on image processing and recognition of the form of graphic objects. In it, at the first stage preliminary processing of the image with the purpose of extracting of characteristic attributes of the form of objects is made. Contours of objects are used as such attributes. For transformation of 2D contours of objects to one-dimensional contour function ArcHeight method has been used. The algorithm for identification contour functions based on metrics DTW is developed. Definition of the identification function based on this method is introduced. Features of application of metrics DTW are stated at identification of the form of objects. Matrices of distances of combinations the sample-sample and the sample-not sample are presented. Results of calculations of metrics DTW on a plenty of real data are analyzed. It is shown, that the developed algorithm allows to identify the form of objects independently of their position and an angle of turn on the image. Influence of the noise imposed on the image of object, on value of the metrics is investigated. Theoretical and practical results of such dependence are received; it shows that in a wide range (up to the ratio a signal/noise 10 dB) value of the metrics practically does not change. The positive parties and lacks of the offered algorithm are noted at identification of the form of object.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

О влиянии шумов на значение метрики DTW при идентификации формы объектов

В работе изложена одна из методологий по обработке изображений и распознавания формы графических объектов. В ней на первом этапе производится предварительная обработка изображения с целью выделения характерных признаков формы объектов. В качестве таких признаков были использованы контуры. Для преобразования 2D контуров объектов в одномерную контурную функцию был использован метод ArcHeight. Для идентификации контурных функций разработан алгоритм на основе метрики DTW. Введено определение идентификационной функции, основанной на этом методе. Изложены особенности применения метрики DTW при идентификации формы объектов. Приведены матрицы расстояний комбинаций эталон-эталон и эталон-неэталон. Проанализированы результаты вычислений метрики DTW на большом количестве реальных данных. Показано, что разработанный алгоритм позволяет идентифицировать форму объектов независимо от их положения и угла поворота на изображении. Исследовано влияние шумов, наложенных на изображение объекта, на значение метрики. Получены теоретические и практические результаты такой зависимости, которые показывают, что в широком диапазоне (до отношения сигнал/шум 10 дБ) значение метрики практически не изменяется. Отмечены положительные стороны и недостатки предложенного алгоритма при идентификации формы объекта.

Текст научной работы на тему «Influence of noise on the DTW metric value in object shape recognition»

#

Вестник РУДН. Серия МИФ

RUDN Journal of MIPh

http://journals.rudn.ru/miph

2018 Vol. 26 No. 4 331-342

UDC 519.683.8:519.6

DOI: 10.22363/2312-9735-2018-26-4-331-342

Influence of Noise on the DTW Metric Value in Object Shape

Recognition

Ivan M. Gostev*, Leonid A. Sevastianov^

* Department of Management Information Systems and Digital Infrastructure

National Research University "Higher School of Economics" 20, Myasnickaya str., Moscow, 101000, Russian Federation

^ Department of Applied Probability and Informatics Peoples' Friendship University of Russia (RUDN University) 6, Miklukho-Maklaya str., Moscow, 117198, Russian Federation

The paper sets out one of the methodologies on image processing and recognition of the form of graphic objects. In it, at the first stage preliminary processing of the image with the purpose of extracting of characteristic attributes of the form of objects is made. Contours of objects are used as such attributes. For transformation of 2D contours of objects to one-dimensional contour function ArcHeight method has been used. The algorithm for identification contour functions based on metrics DTW is developed. Definition of the identification function based on this method is introduced. Features of application of metrics DTW are stated at identification of the form of objects. Matrices of distances of combinations the sample-sample and the sample-not sample are presented. Results of calculations of metrics DTW on a plenty of real data are analyzed. It is shown, that the developed algorithm allows to identify the form of objects independently of their position and an angle of turn on the image. Influence of the noise imposed on the image of object, on value of the metrics is investigated. Theoretical and practical results of such dependence are received; it shows that in a wide range (up to the ratio a signal/noise 10 dB) value of the metrics practically does not change. The positive parties and lacks of the offered algorithm are noted at identification of the form of object.

Key words and phrases: image processing, pattern recognition, metric, DTW, noises

At identification of graphic objects there is a number of problems that need to be investigated first. One of such problems is the influence of noises on the quality of recognition. The use of methodology described in Refs. [1,2] is oriented on multiple stages of preliminary image processing aimed at extracting primary image features, such as contours, angles, texture, etc. [3,4]. At the final stage, the object properties are compared with the sample object properties basing on a certain metric that allows the classification of objects. The object contour is known to be one of the most informative features of the object shape [5,6]. Although the issues of contour receiving and mathematical description attract much attention, a significant role here is played by the recognition methods using signature analysis [7,8]. It is based on the calculation of one-dimensional contour function from the two-dimensional array of the contour points with the use of the center of gravity of the studied object or, in some cases, without it. The process of identifying such one-dimensional contour function is identical to the analysis and comparison of time series data, since such contour function can be considered as an example of time series.

Time series data are widely used in many scientific and practical areas, such as recognition of speech and music, hand-written text, electrocardiograms, stock exchange quotations, in bioinformatics, etc. The use of Euclidean metric (as well as the Lp metrics) for their classification often leads to large errors.

In the present paper in order to identify the object shape after the preliminary processing of the image we use the algorithm ArcHeight, one of the signature analysis algorithms, by using which one gets a contour function considered further as a time series. We briefly present the principles of the classical algorithm DTW that underlies the comparison of

1. Intro duction

Received 27th October, 2018.

time series data. The influence of noises on the value of DTW metric is theoretically analyzed. The results of simulations confirm the theoretical conclusion that the metric value is independent of noise values in a wide range of interval. Examples that show the area of applicability of the present method are presented. In the discussion, the advantages and disadvantage of the presented approach to the identification of contour functions are summarized. In conclusion, the direction of further studies in the field and the perspective of application are outlined.

2. Contour Function

Let us define a contour function as the result of a certain transformation of the 2d form (contour) object into a 1d function of the boundary representation. Most methods of constructing such functions are based on the signature analysis that uses the representation of the contour function with respect to the center of gravity [9]. There are also methods that use contours, but do not use the center of gravity, which can be also considered as signature methods, e.g., the algorithms by Chetverikov [10], Rosenfeld [11], Mokhtarian [12] or the arc-chord distance method [13,14], later referred to as ArcHeight [15].

In the present paper the contour function of the object is obtained using the classical algorithm ArcHeight. This algorithm is based on calculating the length of the perpendicular from the middle of some segment of the curve (the object perimeter) to the chord connecting the end points of the segment. This perpendicular is calculated using the formula

di =

J

((Xj - Xa)(yh - y a) - (yi - ya)(Xb - Xa))2

(xa - xb)2 + (ya - yb)2

(1)

and is proportional to the value of curvature of the segment at the considered point [16].

The sequential tracing of the contour yields a certain tabulated function that characterizes the shape of the object. An example of such function for the silhouette of the aircraft JetCommander1121 is shown in Fig. 1.

FileN=JetCommander1121.jpg

NPoint=2065

500 450 400 350 300 250 200 150 100 50

0

200

400

600

HordLen=35 S.Golay.Num=31

u

200 400 600 800 1000 1200 1400 1600 1800 2000

Figure 1. Silhouette of the aircraft JetCommander1121 with introduced noise at

the level of 40 dB, its contour and its contour function calculated us ing the method ArcHeight. The number of contour points is 2064, the chord length is 35, the Savitzky-Golay filter length is 31 point

0

In the left upper corner of Fig. 1 the silhouette is shown, the right upper corner shows the contour of this, and at the bottom the resulting contour function, obtained using the ArcHeight method is presented. To obtain the contour function we used the following sequence of operations: image loading, filtering, conversion into binary black-and-white image, morphological cleaning of the residual noise, contour receiving, using of the algorithm ArcHeight for the contour function calculation, and its smoothing using Savitzky-Golay method.

Since the size of the objects on the image can be different when applying the ArcHeight method, we used the technique of dynamical setting of the curve segment size to eliminate the size influence. Experiments have shown that the optimal value of the chord is L = K/100 + 14, where L is the segment length and K is the total length of the contour in pixels.

Remark. To enhance the noise immunity of the method for large objects it is possible to use averaged values of coordinates of a few adjacent points rather than a single point as the end coordinates of the chord.

3. The Idea of DTW

Let us denote the contour functions of the object and the sample as the sequences of values Q = q1 ,q2,... ,qi,... ,qn and C = c1,c2,... ,Ci,... cm, respectively, generally with n = m, and let us consider these functions to be time series data, so that the adjacent values of each series have similar time shifts. For comparing these sequences in the classical DTW method the matrix of distances (deformations) D(i,j) with the dimensions m x n is constructed, where each element (i,j) of the matrix represents a certain distance d(q, c) between two points qi and Ci. Commonly this distance is calculated as a Euclidean one, i.e.,

= - c?)1/2, (2)

but it can be calculated as the norm d(qi,Ci) = — q| as well. Each element of the matrix corresponds to the flattened section between the points qi and Ci. The purpose of the DTW algorithm is to construct a certain path W = w1,w2,... ,Wk,..., wk, such that max(m, n) < K < m, + n + 1, obeying the following conditions:

- the boundary condition: for the initial and final points of the matrix diagonal w1 = (1,1) and wk = (m, n);

- the continuity condition: for the adjacent points Wk = (a,b) and Wk = (a',b') it is necessary that (a — d ) < 1 and (b — b' ) < 1;

- the monotonicity condition: for the adjacent points and it is necessary that (a—a') > 0 and (b — b') > 0.

From the theoretical variety of possible paths, the following one is chosen:

DWT (Q,C ) = min

A

K

/K I . (3)

fc=i

In Eq.(3), the divisor K is used to normalize the metric at different dimensions of the sequences Q and C. This path is found basing on dynamical programming by means of the formula

l(i,j) = d(qi,cj) +min{7(i - 1,j - V),^(i - 1,j),^(i,j - ^^ (4)

where ) is the accumulated path length and d(qi,Cj) is the path length from the starting point to the point (i,j). As a result, a certain number is obtained characterizing the distance between the sequences Q and C, which will be denoted by pdtw.

Figure 2 presents three matrices of distance.

50 100 150 200 50 100 150 200 0 40 80 120 160 200

Figure 2. Three matrices of distance and the calculated paths for comparison of objects: EE (left); EI2 with distortion of one of the objects (middle), EN (right)

The left-hand one results from the comparison of two identical sequences SS (Sample-Sample), the middle one is obtained for a certain variation of one sequence with respect to ano5her, and the right-hand one is obtained for different sequences SN (Sample-Non-Sample). The diagonal top-bottom line corresponds to the optimal path. The smaller is the deviation of this line from the diagonal, the smaller is the difference between the se2q00uences. 80 80

Let us consider the problem of whether the minimal DTW path satisfies the definition of metric. For this aim, let us check the fulfilment of the metric requirements:

— the axiom of identity d(x, y) ^ 0, the equality to zero corresponding to x = y. Since the elements of the matrix of distances are calculated in an Euclidean space, it is obvious that each element of the matrix will be positive and, therefore, the condition is satisfied. When the sequences coincide, the distance along the principal diagonal will equal 0, since the Euclidean metric, calculated for each element of the diagonal, will be equal to zero;

— the axiom of symmetry d(x, y) = d(y, x). This condition is also obviously valid, since the permutation of positions in the metric means the transposition of the matrix of distances;

— the triangle inequality d(x, z) ^ d(x, y) + d(y, z). Automatically follows from 1 and 2 being presented as d(x, y) ^ d(x, z) + d(y, z).

Thus, the use of the term "DTW metric" is fully justified.

Let us define the identification function, based on the DTW method, as

A =i1, Pdtw<0£' (5)

[0, pdtw ^ e.

where pdtw is the DTW metric, and £ is the classification tolerance (CT) for the considered method.

Let us analyze some result2s00of DTW metric calculations for realistic data. As an example, consider the contour functions of aircraft silhouettes presented in Fig. 31.

1Such objects were chosen on purpose, since the silhouettes of the planes do not differ strongly, so it

is interesting to classify such objects (i.e., to divide them into classes).

■f-jH* "H-fr

Figure 3. Silhouettes of aircrafts1 (from top to bottom, from left to right) Grumman TBF-1, Aero L-29, Arado Ar 234B, Curtiss P-40F, Dornier DO-335A, Grumman F4F-3, Arado Ar 240, Avro 668, Bell-X1, Jet Commander 1121, Mitchell-D-25C, Boeing B-47E (strongly reduce)

Table 1 presents the values of metrics calculated for all combinations of aircraft silhouettes in Fig. 3. At the principal diagonal, the values of DTW metric calculated for the aircrafts of the same name are located. For combinations of different planes (SN) we get the mean value ^ = 0.639 and the variance a = 0.045, which allows the conclusion that the mean value for sample-sample combinations is essentially smaller than for sample-non-sample combinations. This is an evidence of DTW metric efficiency in the object shape recognition.

One more issue of importance is the type of metric distribution under different distortions of the contour function. The simulation has shown that the distribution of the DTW metric value under the influence of noise occurs in accordance with the normal law. Thus, e.g., Fig. 4 presents a histogram of the metric values and the distribution density function calculated on its base for the Avro668 plane SS combinations. The metric value is different from zero, since in the process of investigation a noise was added on the plane silhouette that distorted the shape of the object and, finally, the contour function.

Data

Figure 4. Histogram and metric distribution density function for the Avro668

plane SS combinations

Table 1

DTW Metrie Values

Grumman TBF-1 Aero L-29 Arado Ar234E Arado Ar240 Avro 668 Bell-Xl Boeing B47E Curtiss P-40F Dornier D0335A Grumman F4F-3 JetCom-mander-1121 Mitchell-D-25C

Grumman TBF-1 0.000 0.533 1.116 0.927 0.342 0.504 0.480 0.480 0.526 0.509 0.556 0.923

Aero L-29 0.533 0.000 0.804 0.741 0.431 0.531 0.590 0.467 0.488 0.565 0.544 0.727

Arado Ar234B 1.116 0.804 0.000 0.688 0.998 0.794 1.214 0.941 0.777 0.765 0.953 0.559

Arado Ar240 0.927 0.741 0.688 0.000 0.864 0.700 1.026 0.944 0.789 0.780 0.843 0.698

Avro 668 0.342 0.431 0.998 0.864 0.000 0.418 0.361 0.313 0.388 0.321 0.475 0.764

Bell-Xl 0.504 0.531 0.794 0.700 0.418 0.000 0.563 0.485 0.429 0.495 0.568 0.641

Boeing B47E 0.480 0.590 1.214 1.026 0.361 0.563 0.000 0.479 0.475 0.507 0.586 0.972

Curtiss P-40F 0.480 0.467 0.941 0.944 0.313 0.485 0.479 0.000 0.344 0.410 0.607 0.778

Dornier D0335A 0.526 0.488 0.777 0.789 0.388 0.429 0.475 0.344 0.000 0.355 0.588 0.655

Grumman F4F-3 0.509 0.565 0.765 0.780 0.321 0.495 0.507 0.410 0.355 0.000 0.569 0.628

Jet Com- mander 1121 0.556 0.544 0.953 0.843 0.475 0.568 0.586 0.607 0.588 0.569 0.000 0.882

Mitchell-D-25C 0.923 0.727 0.559 0.698 0.764 0.641 0.972 0.778 0.655 0.628 0.882 0.000

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

The verification of the hypothesis of normal distribution has shown high accuracy of agreement with the parameters n = 0.1925127; a = 0.001491. Multiple experiments with other silhouettes also confirm the normal distribution law for the values of metrics under the effect of noise.

4. Specific Features of DTW Metric Application

Under process identification real objects, their size and position with respect to the sample are not known in advance. Therefore, it is necessary to use so me algorithmic method for correct matching of contour functions in the process of recognition.

Moreover, even for the rotation of a single objecS the contour function shifts by a certain number of points cocresponding to the rotation angle. An example of such contour functions with the cyclic shift by nearly (540 points for the silhouette of BoeingB47E is shown i n Fig. 5.

200 400 600 800 1000 1200

Figure 5. BoeingB47E aircraft silhouettes (reduce) with different angle of rotation and their contour functions with cyclic shift

0

Calculating the DTW metric for cyclically shifted contour functions from Fig. 6, we get the value patw = 0.7174, which corresponds to the SN combination for the compared contour functions . Since the orientation of the object silhouette, i.e., the shift of one contour function with re spect to another is not known a priory, some operations are to be performed to resolve the problem. Ffrst, assume that the number of points of the sample exceeds that of the object to be recognized, e.e., m > n. Otherwise, we change the object for the sample and vice verse. Socond, it is necessary to augment the sample from the right by the number of points, corresponding to t lie object , i.e., C* = c1 ,c2,... ,cm,c1 ,c2,... ,cn. Third, it is needed to organize the sequential comparison of the object with the fragment of the augmented sample. In other words, we calculate the DTW as a function of the object position on the augmented function Cp of the sample in the

form FDTW(Q,Cp(i)) = min (1 wk/K^j , where Cp(i) = C*(ct,c*+i,... ,c*+»-i),

i = 1, n. Here Cp at a certain i determines the value of shift of the window moved along the sample, in which the values of the DTW metric is calculated. The metric value will be calculated as pdtw = min (FDTW(Q,Cp(i))).

5. Noise Resistance

In the development of new methods for identification of the object shape one has to take the influence of different factors into account [17]. One of them is the permanent presence of noises in the real image, which finally distorts the contour function. Let us estimate the influence of noises on the DTW metric value.

Let the contour function be presented as Q = q1,q2,... ,qi,... ,qn for the object and as C = qi + i = 1, n for the sample, where is a certain value of the noise component distributed according to the normal law and added to each value of the numerical sequence. Let us substitute the values of C into Eq. (2) and calculate the distance for each value of i:

d(qi - Ci) = (qi - (qi - £¿))2 , i - 1,n. (6)

In fact, the result of the calculation means that under the influence of noises the value of metric converges to the %2 law [18]1. According to the central limit theorem, when i converges to infinity for a large number of degrees of freedom k, the distribution of the random variable Y ^ %2 (k) can be approximated by the normal distribution, with parameters Y = N(k, 2k), with k ^ < (more exactly, ^ N(0,1)). Therefore, the

value of metric in the sum of Eq. (6) for a large number point of sequence must tend to zero. Hence, the value of metric should not change under the addition of noises to the contour function in a wide range of the values, determining the shape of objects.

Let us verify this statement by simulation using the Matlab software. We impose random noise on the contour function (series), e.g., the one obtained from the silhouette of the AeroL29 aircraft, using the built-in function awgn that allows the addition of noises to an array of points with the level expressed in dB2. Let us calculate the values of metric using Eq. (4) with Eq. (6) taken into account, starting from the signal-to-noise ratio 100 dB and gradually reducing it to 3 dB with the step of 1 dB. The simulation results for three silhouettes of AeroL29, JetCommander1121, and MitchellD25C shown in Fig. 6 demonstrate that the metric remains unchanged practically until the noise level becomes comparable with the signal level.

0.18 -1-1-1-1-1-1-1-1-1-

0.16 -

Signal/Noise (dB)

Figure 6. Metric values versus tlie noise level for AeroL29 (solid line), JetCommander1121 (dashed line), and MitchellD25C (dash-dotted line)

Basing on the considerations presented above, we conclude that the DTW metric is persistent against the noise in a wide range of values of noise.

1I n this paper it is shown that t lie image noises always lead to the normal law of distribution for metric.

2Due to the normal distribution of the noise component, the addition of noiie to ehe contour function is equivalent to the addition of noise to the image with its subsequent processing and generation of contour function.

6. Discussion and Conclusion

The DTW metric offers wide possibilities of object shape recognition. The disadvantage of the algorithm include high computation complexity of calculating the metric O(mn) and, as a consequence, essential increase of the computation time for sequences of high dimensions.

High potentialities are related to the application of DTW metric to the recognition of graphical objects shape. The capabilities of DTW allow the identification of objects, for which the description is based on the nonlinear variation of the contour function step that occurs in the case of object shape distortion. In these cases, the metrics of the Lp type are inapplicable. The analysis of the DTW metrics demonstrates its resistance against the noise, which significantly extends the area of its application. Besides the identification of the object shapes, this metric is widely used in speech recognition, in medicine, bioinformatics, and economics in the analysis of trends, where the parasitic noise level can be very high, etc.

Acknowledgments

The work was supported by the Russian Foundation for Basic Research, grant No. 1607-00556.

References

1. M. Seul, L. O'Gorman, M. Sammon, Practical Algorithms for Image Analysis, Cambridge University Press, 2000.

2. W. K. Pratt, Digital Image Processing (Fourth edition), Wiley, 2007.

3. R. O. Duda, P. E. Hart, Pattern Classification and Scene Analysis, Wiley, 1973.

4. R. Gonzalez, R. Woods, Digital Image Processing, Addison-Wesley Publishing Company, Reading, 1992.

5. D. Marr, Vision: A Computational Investigation Into the Human Representation and Processing of Visual Information, Published March 15th 1983 by W. H. Freeman, 1983.

6. R. Klette, Digital Geometry: Geometric Methods for Digital Image Analysis Usa, Morgan Kaufmann, 2004.

7. K. William, Pratt Digital image processing, Wiley publication, 1978.

8. S. Mark, S. Nixon Alberto, Aguado Feature Extraction & Image Processing for Computer Vision Third edition, Elsevier, 2012.

9. D. H. Ballard, C. M. Brown, Computer Vision, Prentice-Hall. Inc., 1982.

10. D. Chetverikov, A Simple and Efficient Algorithm for Detection of High Curvature Points in Planar Curves, Vol. 2756, Springer, Berlin, Heidelberg, 2003, pp. 746-753. doi:10.1007/978-3-540-45179-2_91.

11. A. Rosenfeld, E. Johnston, Angle Detection on Digital Curves, IEEE Transactions on Computers C-22 (1973) 875-878. doi:10.1109/TC.1973.5009188.

12. F. Mokhtarian, A. K. Mackworth, A Theory of Multiscale, Curvature-Based Shape Representation for Planar Curves, IEEE Transactions on Pattern Analysis and Machine Intelligence 14 (1992) 789-805. doi:10.1109/34.149591.

13. U. Ramer, An Iterative Procedure for the Polygonal Approximation of Plane Closed Curves, Computer Graphics Image Processing 1 (1972) 244-256. doi:10.1016/S0146-664X(72)80017-0.

14. T. Y. Phillips, A. Rosenfeld, A Method for Curve Partitioning Using Arc-Chord Distance, Pattern Recognition Letters 5 (1987) 285-288. doi:10.1016/0167-8655(87)90059-6.

15. Y. Lin, J. Dou, H. Wang, Contour Shape Description Based on an Arch Height Function, Journal Pattern Recognition 25 (1992) 17-23. doi:10.1016/0031-3203(92)90003-2.

16. J. H. Han, T. Poston, Chord-to-Point Distance Accumulation and Planar Curvature: a New Approach to Discrete Curvature, Pattern Recognition Letters 22 (2001) 1133— 1144. doi:10.1016/S0167-8655(01)00063-0.

17. H. Ding, G. Trajcevski, P. Scheuermann, X. Wang, E. J. Keogh, Querying and mining of time series data: experimental comparison of representations and distance measures, PVLDB 1 (2008) 1542-1552. doi:10.14778/1454159.1454226.

18. I. M. Gostev, About Modelling and Estimation of Classification Tolerance, Bulletin of Peoples' Friendship University of Russia. Series: Applied and Computer Mathematics 3 (1) (2004) 85-92, in Russian.

УДК 519.683.8:519.6

DOI: 10.22363/2312-9735-2018-26-4-331-342

О влиянии шумов на значение метрики DTW при идентификации формы объектов

И. М. Гостев*, Л. А. Севастьянов^

* Кафедра управления информационными системами и цифровой инфраструктурой Национальный исследовательский университет «Высшая школа экономики» ул. Мясницкая, д. 20, Москва, Россия, 101000 t Кафедра прикладной информатики и теории вероятностей Российский университет дружбы народов ул. Миклухо-Маклая, д. б, Москва, Россия, 11719S

В работе изложена одна из методологий по обработке изображений и распознавания формы графических объектов. В ней на первом этапе производится предварительная обработка изображения с целью выделения характерных признаков формы объектов. В качестве таких признаков были использованы контуры. Для преобразования 2D контуров объектов в одномерную контурную функцию был использован метод ArcHeight. Для идентификации контурных функций разработан алгоритм на основе метрики DTW. Введено определение идентификационной функции, основанной на этом методе. Изложены особенности применения метрики DTW при идентификации формы объектов. Приведены матрицы расстояний комбинаций эталон-эталон и эталон-неэталон. Проанализированы результаты вычислений метрики DTW на большом количестве реальных данных. Показано, что разработанный алгоритм позволяет идентифицировать форму объектов независимо от их положения и угла поворота на изображении. Исследовано влияние шумов, наложенных на изображение объекта, на значение метрики. Получены теоретические и практические результаты такой зависимости, которые показывают, что в широком диапазоне (до отношения сигнал/шум 10 дБ) значение метрики практически не изменяется. Отмечены положительные стороны и недостатки предложенного алгоритма при идентификации формы объекта.

Ключевые слова: обработка изображений, распознавание образов, метрики, DTW, шумы

Литература

1. Seul M., O'Gorman L., Sammon M. Practical Algorithms for Image Analysis. — Cambridge University Press, 2000. — 295 p.

2. Pratt W. K. Digital Image Processing (Fourth edition). — Wiley, 2007. — 807 p.

3. Duda R. O, Hart P. E. Pattern Classification and Scene Analysis. — Wiley, 1973.

4. Gonzalez R., Woods R. Digital Image Processing. — Addison-Wesley Publishing Company, Reading, 1992. — 191 p.

5. Marr D. Vision: A Computational Investigation Into the Human Representation and Processing of Visual Information. — Published March 15th 1983 by W. H. Freeman, 1983. — 397 p.

6. Klette R. Digital Geometry: Geometric Methods for Digital Image Analysis Usa. — Morgan Kaufmann, 2004.

7. William K. Pratt Digital image processing. — Wiley publication, 1978. — 750 p.

8. Mark S., Nixon Alberto S. Aguado Feature Extraction & Image Processing for Computer Vision Third edition. — Elsevier, 2012. — 622 p.

9. Ballard D. H., Brown C. M. Computer Vision. — Prentice-Hall. Inc., 1982. — 548 p.

10. Chetverikov D. A Simple and Efficient Algorithm for Detection of High Curvature Points in Planar Curves // Computer Analysis of Images and Patterns. CAIP 2003. Lecture Notes in Computer Science / Ed. by N. Petkov, M. A. Westenberg. — Berlin, Heidelberg: Springer, 2003. — Vol. 2756. — Pp. 746-753. — DOI: 10.1007/978-3-540-45179-2_91.

11. Rosenfeld A., Johnston E. Angle Detection on Digital Curves // IEEE Transactions on Computers. — 1973. — Vol. C-22, issue 9. — Pp. 875-878. — DOI: 10.1109/TC.1973.5009188.

12. Mokhtarian F., Mackworth A. K. A Theory of Multiscale, Curvature-Based Shape Representation for Planar Curves // IEEE Transactions on Pattern Analysis and Machine Intelligence. — 1992. — Vol. 14, issue 8. — Pp. 789-805. — DOI: 10.1109/34.149591.

13. Ramer U. An Iterative Procedure for the Polygonal Approximation of Plane Closed Curves // Computer Graphics Image Processing. — 1972. — Vol. 1, issue 3. — Pp. 244-256. — DOI: 10.1016/S0146-664X(72)80017-0.

14. Phillips T. Y., Rosenfeld A. A Method for Curve Partitioning Using Arc-Chord Distance // Pattern Recognition Letters. — 1987. — Vol. 5, issue 4. — Pp. 285-288. — DOI: 10.1016/0167-8655(87)90059-6.

15. Lin Y., Dou J., Wang H. Contour Shape Description Based on an Arch Height Function // Journal Pattern Recognition. — 1992. — Vol. 25, issue 1. — Pp. 17-23. — DOI: 10.1016/0031-3203(92)90003-2.

16. Han J. H., Poston T. Chord-to-Point Distance Accumulation and Planar Curvature: a New Approach to Discrete Curvature // Pattern Recognition Letters. — 2001. — Vol. 22, issue 10. — Pp. 1133-1144. — DOI: 10.1016/S0167-8655(01)00063-0.

17. Querying and mining of time series data: experimental comparison of representations and distance measures / H. Ding, G. Trajcevski, P. Scheuermann et al. // PVLDB. — 2008. — Vol. 1, issue 2. — Pp. 1542-1552. — DOI: 10.14778/1454159.1454226.

18. Гостев И. М. О моделировании и оценке классификационного допуска // Вестник РУДН. Серия: Прикладная и компьютерная математика. — 2004. — Т. 3, № 1. — С. 85-92.

© Gostev Ivan M., Sevastianov Leonid A., 2018

ED

This work is licensed under a Creative Commons Attribution 4.0 International License Для цитирования:

Gostev I. M., Sevastianov L. A. Influence of Noise on the DTW Metric Value in Object Shape Recognition // RUDN Journal of Mathematics, Information Sciences and Physics. — 2018. — Vol. 26, No 4. — Pp. 331-342. — DOI: 10.22363/2312-9735-2018-26-4-331-342.

For citation:

Gostev I. M., Sevastianov L. A. Influence of Noise on the DTW Metric Value in Object Shape Recognition, RUDN Journal of Mathematics, Information Sciences and Physics 26 (4) (2018) 331-342. DOI: 10.22363/2312-9735-2018-26-4-331-342.

Сведения об авторах:

Гостев Иван Михайлович — доктор технических наук, профессор кафедры управления информационными системами и цифровой инфраструктурой Национального исследовательского университета «Высшая школа экономики» (e-mail: igostev@hse. ru, тел.: +7(495)7729590)

Севастьянов Леонид Антонович — профессор, доктор физико-математических наук, профессор кафедры прикладной информатики и теории вероятностей РУДН (e-mail: sevastianov-la@rudn.ru, тел.: +7(495)9522572)

Information about the authors:

Gostev Ivan M. — Doctor of Technical Sciences, professor of Department of Information Systems and Digital Infrastructure Management of National Research University "Higher School of Economics" (e-mail: igostev@hse.ru, phone: +7(495)7729590) Sevastianov Leonid A. — Professor, Doctor of Physical and Mathematical Sciences, Professor of Department of Applied Probability and Informatics of Peoples' Friendship University of Russia (RUDN University) (e-mail: sevastianov-la@rudn.ru, phone: +7(495)9522572)

i Надоели баннеры? Вы всегда можете отключить рекламу.