Научная статья на тему 'Acceleration of summation over segments using the fast Hough transformation pyramid'

Acceleration of summation over segments using the fast Hough transformation pyramid Текст научной статьи по специальности «Математика»

CC BY
134
14
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
SEARCH FOR SEGMENTS / FAST HOUGH TRANSFORMATION / DISCRETE RADON TRANSFORMATION / BRADY ALGORITHM / FAST DISCRETE RADON TRANSFORMATION / DYADIC PATTERN / BEAMLET PYRAMID / ПОИСК ОТРЕЗКОВ / БЫСТРОЕ ПРЕОБРАЗОВАНИЕ ХАФА / ДИСКРЕТНОЕ ПРЕОБРАЗОВАНИЕ РАДОНА / АЛГОРИТМ БРЕЙДИ / БЫСТРОЕ ДИСКРЕТНОЕ ПРЕОБРАЗОВАНИЕ РАДОНА / ДИАДИЧЕСКИЙ ПАТТЕРН / БИМЛЕТ-ПИРАМИДА

Аннотация научной статьи по математике, автор научной работы — Soshin K.V., Nikolaev D.P., Gladilin S.A., Ershov E.I.

In this paper, we propose an algorithm for fast approximate calculation of the sums over arbitrary segments given by a pair of pixels in the image. Using the results of intermediate calculations of the fast Hough transform, the proposed algorithm allows to calculate the sum over arbitrary line segment with a logarithmic complexity depending on the linear size of the original image (also called fast discrete Radon transform or Brady transform). In this approach, the key element of the algorithm is the search for the dyadic straight line passing through two given pixels. We propose an algorithm for solving this problem that does not degrade the general asymptotics. We prove the correctness of the algorithm and describe a generalization of this approach to the three-dimensional case for segments of straight lines and of planes.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

УСКОРЕНИЕ СУММАЦИИ ПО ОТРЕЗКАМ С ИСПОЛЬЗОВАНИЕМ ПИРАМИДЫ БЫСТРОГО ПРЕОБРАЗОВАНИЯ ХАФА

В работе предложен алгоритм быстрого приближенного вычисления на изображении сумм по произвольным отрезкам, задаваемым парой пикселей. Используя результаты промежуточных вычислений быстрого преобразования Хафа, предложенный алгоритм позволяет рассчитать сумму по произвольному отрезку линии с логарифмической сложностью, зависящей от линейного размера исходного изображения. Предподсчет реализован как модификация алгоритма Брейди быстрой суммации по диадическим аппроксимациям прямых. При таком подходе ключевым элементом алгоритма является поиск диадической прямой, проходящей через два данных пикселя. В работе предложен алгоритм решения этой задачи, не ухудшающий общую асимптотику, для него доказана корректность. Также в работе описывается обобщение этого подхода на трехмерный случай для отрезков и для сегментов плоскостей.

Текст научной работы на тему «Acceleration of summation over segments using the fast Hough transformation pyramid»

MSC 65D18

DOI: 10.14529/ mmp200110

ACCELERATION OF SUMMATION OVER SEGMENTS USING THE FAST HOUGH TRANSFORMATION PYRAMID

K.V. Soshin1'2, D.P. Nikolaev2, S.A. Gladilin2, E.I. Ershov2

1Moscow Institute of Physics and Technology, Dolgoprudny, Russian Federation 2Institute for Information Transmission Problems (Kharkevich Institute), Moscow, Russian Federation

E-mails: soshin.kv@phystech.edu, dimonstr@iitp.ru, gladilin@iitp.ru, ershov@iitp.ru

In this paper, we propose an algorithm for fast approximate calculation of the sums over arbitrary segments given by a pair of pixels in the image. Using the results of intermediate calculations of the fast Hough transform, the proposed algorithm allows to calculate the sum over arbitrary line segment with a logarithmic complexity depending on the linear size of the original image (also called fast discrete Radon transform or Brady transform). In this approach, the key element of the algorithm is the search for the dyadic straight line passing through two given pixels. We propose an algorithm for solving this problem that does not degrade the general asymptotics. We prove the correctness of the algorithm and describe a generalization of this approach to the three-dimensional case for segments of straight lines and of planes.

Keywords: search for segments; fast Hough transformation; discrete Radon transformation; Brady algorithm; fast discrete Radon transformation; dyadic pattern; beamlet pyramid.

Dedicated to Professor V.L. Arlazarov in honour of his anniversary.

Introduction

In the paper [1], D.L. Donoho and coauthors propose an approach to multiscale image analysis called the beamlet analysis. The approach allows to construct detectors of objects (segments having unknown position, length and orientation; circles having unknown position and diameter, etc.) on highly noisy images [2]. From a practical point of view, the proposed approach has a drawback, since a complex pre-calculation is necessary. Indeed, for an input image, all beamlet analysis algorithms assume that the beamlet dictionary is pre-calculated as a set of segments having various orientations, while the position and length of the segments are chosen in a special way. However, the authors do not discuss the computational complexity of constructing such a dictionary and do not propose algorithms to calculate the dictionary. Therefore, it is difficult to estimate the practical applicability of the approach as a whole. The paper is devoted to the issues of the pre-calculation complexity, as well as the fast finding of the sums over arbitrary segments in the image as a whole.

Let us describe the approach proposed by D.L. Donoho and coauthors in more detail. The approach involves construction of a dyadically organized beamlet dictionary. The authors consider an image as a function of two variables defined on a discrete lattice in the square [0,1]2 with the discretization step 1/n, where n = 2p, p G N, and the length of the segment is defined as a number of pixels formed the segment. Then the dyadic square

S(k1, k2, q) is the set of points [k1/2q, (k1 + 1)/2q] x [k2/2q, (k2 + 1)/2q] on the plane. Here q defines the level of the dyadic square, and (k1; k2) define the position of the square, where 0 < q < p and 0 < k1,k2 < 2q — 1. In particular, the whole image is the dyadic square S(0, 0, 0) of the level 0, and an arbitrary pixel having the coordinates (i,j) is the dyadic square S(i,j,p) of the level p. A beamlet is a segment having arbitrary orientation whose ends belong to the sides of the same dyadic square. The coordinates of the beamlet ends are also given with a certain discretization, and the discretization quantum can be taken to be equal to or smaller than the size of a pixel. In the present paper, we consider only the first case, while the subpixel discretization is not of our interest.

By a beamlet transformation we mean an integration of an image over all possible beamlets in the dictionary. For an image of the size nxn pixels, the beamlet transformation allow to construct a beamlet pyramid containing 6(n2 log n) values. The beamlet pyramid is the basis for the approach to image analysis proposed by D.L Donoho and coauthors [2], and the issue of calculation of the beamlet pyramid is considered, in particular, in the present paper.

The most obvious way to calculate the sum over pixels of a segment is to sum the values of these pixels. Further, we refer to an algorithm based on this obvious way as the "naive" algorithm. If segments have a significant intersection (such a case is typical for the beamlet dictionary), then the "naive" algorithm is ineffective, since the common sub-sums are calculated many times. For example, the "naive" algorithm for calculating the beamlet transformation requires 0(n3) operations, which is too much to use in real time and on mobile platforms. However, D.L Donoho and coauthors refuse to consider the issue of fast calculation of the beamlet transformation and assume such a result to be already known for a given image. At the same time, the authors list several studies aimed at fast calculation of the discrete Radon transformation, which is similar to the beamlet transformation. One of these approaches is the algorithm of the fast discrete Radon transformation. The algorithm was proposed by P. Brady [3] and is also known as the fast Hough transformation (FHT) [4]. Nowadays, the algorithm is studied in detail. In 6(n2 logn) operations, the algorithm allows to calculate accurately the total Hough-image (i.e. an image, each pixel of which contains the sum over the pattern, the parameters of which are usually given by the coordinates of this pixel) over the dyadic patterns (further, we refer to dyadic patterns as dyadic straight lines). The dyadic straight lines well approximate straight lines on the plane, since the approximation error along the ordinate axis is log62 n for straight lines with the slope

k < 1 in the parametrization by an angular coefficient [5]. Moreover, it is proved that the asymptotic complexity of the FHT algorithm cannot be improved [6].

In the FHT algorithm, the sum over a dyadic pattern is composed of two sums over halves of the pattern, which are also dyadic patterns, while the sums over the halves are also composed of the sums over their halves, etc. Therefore, if intermediate results are

0

x

Fig. 1.

rization

The (s, t)-paramete-of a predominantly horizontal pattern with a downward slope (t > 0)

saved during the process of calculating the FHT, then a FHT pyramid is formed, which is largely similar to a beamlet pyramid. Saving intermediate results does not increase asymptotic complexity of the FHT algorithm, therefore, the FHT pyramid can be constructed in 6(n2 logn) operations.

In this paper, we propose an algorithm for fast (in ©(logn) operations) finding of the sum over an arbitrary (given by a pair of points) segment of a dyadic pattern in an image. This algorithm is based on the FHT-pyramid and can be used as a method for calculating the fast beamlet transformation (approximation of the beamlet transformation summing over dyadic patterns). The asymptotic complexity of the fast beamlet transformation is 6(n2 log2 n) operations. The proposed algorithm is also of independent value in problems that require to calculate numerous sums over segments in an image, but do not require such a high-level theory as the beamlet analysis [7]. At the same time, if it is necessary to find only a summation over segments having a certain range of slope angles, then the volume of the pre-calculation can be reduced by performing a partial FHT and calculating the pyramid only for the partial FHT [8].

For three-dimensional images, two generalizations of the Radon transformation are known: the three-dimensional Radon transformation (summations over straight lines in the three-dimensional space) and the John transformation (summations over planes). For both generalizations, the algorithm for fast summation over dyadic patterns is known: the three-dimensional fast Hough transformation is used over straight lines and planes, respectively [9]. In this paper, we propose an algorithm for fast summation over segments of dyadic straight lines in a three-dimensional image and over segments of dyadic planes of a special type.

1. Fast Finding of the Sum Over an Arbitrary Segment

For brevity, we consider only square images of the size n x n pixels, where n = 2p. Following the paper [9], we use the (s, t)-parameterization of straight lines in the image. To this end, first of all, we divide all the straight lines into two classes: predominantly vertical straight lines (i.e. vertical straight lines and straight lines with |k| > 1 in the parameterization by the angular coefficient) and predominantly horizontal straight lines (i.e. straight lines with |k| < 1). Therefore, we refer straight lines with k = 1,

0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ■■■■■■■■■■■■■■■I

0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0

0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ■■■■■■■■■■■■■■■■

■■■■■■■■■■■■■■■I

1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 |2 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

щ 0 Ш 2

J 0

0 0 0 0 0 8 8| 0 0 0 с|8|Й 000 °Ts||s| 00001 ^0 0 0 00 |12 00000221 0 0 0 0 |2 2 2 1 0 0 0 ЙИ 2

4 9 5 3 3

5 5 0 0 0 0 0 0 20000000 00000000 00000000 00000000 00000000

00000000 00000000 00000000 00000000 00000000 00000000 00000001 000000 |1 2 00000221 0 0 0 °l2 2 2 1 0 0 0002 2 2

2 4 8 4 2 2

4 4 0 0 0 0 0 0 20000000 00000000

о X

000000000 ОТЭШ0Е 1 1 0 0 0 0 0 0 ШШШ18 0 2 2 2 1 2

0 ОЭШШИ 0 0 0 0 2 2 2 2 3 2

JH 0 0 0 0 0 0 0 2 2 2 3 2 3 2

0 0 0 0 0 0 0 0 |2 4 4 4 3 2 1 1

0 0 0 0 0 0 0 4 4 2 2 2 1 1

0 0 0 0 0 0 ПН 6 4 2 2 |3 3 3 3

2 4 4 4 0

2 4 4 0 0

4 2 0 0 0

2 0 0 0 0

00000000000

0000000000000001 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 2

f S

Fig. 2. FHT-pyramid for an image of the size 16 x 16

0001 0001 0001 0001 0001 0001 0001 0001 + + + + + + + + + + + + + + + +

° 8 8

3 2 12

00000000

00000000

t

9 17 9

3333

i.e. inclined at an angle 45°, to both classes. Note that the FHT for predominantly horizontal straight lines is also suitable for predominantly vertical ones. Indeed, it is enough to rotate the original image by 90°. Therefore, further we consider only predominantly horizontal straight lines. Such a straight line can be given by the parameters s and t of the coordinates of two points on the vertical boundaries of the image: (0, s) and (n — 1, s +1), where t G [—n +1; n — 1] and s G [—n +1; n — 1], see Fig. 1. For brevity, we consider only the case of t > 0, i.e. predominantly-horizontal patterns inclined downward and the Hough-image for such patterns.

A Hough-image is represented as an image of the size n x n, whose each point (s,t) contains the sum over the pattern with such parameters. In this paper, we are interested not only in the algorithm itself, but also in the intermediate results of the algorithm implementation, which can be represented as (log2 n + 1) images (levels) of the size n x n forming a FHT-pyramid.

1.1. FHT-Pyramid

Taking into account the assumptions and the definitions given above, we describe the FHT-pyramid and the method of its calculation. The zero level of the FHT-pyramid is the original image, and the last level is the Hough-image containing sums over dyadic straight lines of the length n = 2p. Similarly to the description of the beamlet pyramid level, we describe the q-th level of the FHT-pyramid. To this end, divide the original image into vertical stripes [k ■ 2q, (k + 1) ■ 2q — 1] x [0, n — 1], where k denotes the number of the stripe, k G [0, 2p-q — 1]. For each stripe, the q-th level of the FHT-pyramid contains the sums over all possible patterns of the stripe that have the length 2q and the parameters s G [0, n — 1] and t G [0, 2q — 1]. The number of such patterns is equal to n x 2q for each strip. Therefore, the q-th level of the FHT-pyramid requires as much memory as the original image. Fig. 2 shows an example of the algorithm implementation and the formation of all levels of the FHT-pyramid.

1.2. Finding of the Sum Over a Segment of a Known Dyadic Straight Line

Consider a fast way to calculate the sum over any segment in the image using the FHT-pyramid [10].

Suppose that it is necessary to find the sum over the segment B'C of a predominantly horizontal inclined downward dyadic straight line AD, whose parameters (s,t) are known. Let us find the sum as the difference between the sums over the segments AC and AB.

Denote the length of the segment AB by l. Let us find the sum over AB as the sum of the minimum number of already calculated sums saved in the FHT-pyramid, see Fig. 3.

Suppose that the binary expansion of the length of AB has the form l = YJj=o j2j. Then the sum over the segment AB is calculated by the formula

p p Ss,i(l) = 5] [lq ■ Hq, £ j ■ 2j-q)

q=0 j=q+1

where Hs,t(q, k) denotes the sum over the pattern of the length 2q in the k-th stripe of the q-th level of the FHT-pyramid for a straight line with the parameters (s,t).

The internal sum does not require a complete calculation at each step, since the sum is obtained from the previous one in constant time. Therefore, the complexity of the

algorithm is proportional to the number of terms in the external sum, i.e. is equal to ©(logn). Similarly, the sum over the segment AC is calculated. For the case of dyadic straight

lines in a three-dimensional image, note that the described algorithm for finding of the sums is almost identical, except that the search is performed in a four-dimensional FHT-pyramid, which does not affect the complexity of the search due to dyadic organization of the FHT-pyramid.

1.3. Finding of a Dyadic Pattern

3(00112)

10(10102 )

[A Z B B' C □ D

01

00

11

10

+

- +

Dyadic pattern

Level 3 Level 2

Level 1

Original image

Fig. 3. Calculation of the sum over the segment B'C of the straight line AD as the difference between the sums over the segments AC and AB

The paper [9] proves that at least one dyadic pattern passes through any two pixels, but such a pattern can

be not unique. Let us solve the problem on finding of the parameters (s, t) of any one of the patterns passing through the given points (xi,yi) and (x2,y2). Without loss of generality, we assume that x2 > x1. Since, as stated above, we consider only predominantly horizontal patterns inclined downwards, then y2 > y1 and y2 — y1 < x2 — x1.

Suppose that the binary expansion of the parameter t of the desired pattern has the form t = Y^Zo tj2j. Then the pattern can be given by the following equation [5]:

p-i

y = Ds>i(x) = s + ^ tr

r=0

2rx 2 p - 1

(1)

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

where [ ■ ] denotes rounding to the nearest integer. Expression (1) shows that translation of any dyadic pattern can be represented as the sum of subset translations of the basic dyadic patterns of the form

Dr (x) =

2rx

2 p - 1

(2)

Substitute the coordinates of the points (xi,yi) and (x2,y2) into expression (1) for the pattern and subtract the second equation from the first one:

p-i

У2 - yi = Ds,i(x2) - Ds,t(xi) = £tr(Dr(x2) - Dr(xi)).

r=0

Denote Ay = y2 — y1. Let ^X1^2 = Dj(x2) — Dj(x1) be the number of shifts of the i-th basic dyadic pattern in the stripe (x1,x2), then the equation takes the form

Ay

p-i

r=0

X2

Therefore, the problem on finding of a slope is reduced to the following problem: to restore the bits to, t1, ..., tp-1 for the known Ay and ^X1'^, ^X1'^, ..., ^X-'X2.

To this end, apply the "greedy" algorithm. First, consider all tj to be zero. Since ¿f'X2 > ^J1'®2 for i > j, we go through shifts from the large number to less one, that is,

1

go from the larger level to zero one. If Ay > ¿j11'12, then consider t to be equal to 1 and reduce Ay by ¿j11'12. Repeat the procedure until Ay is not equal to zero (we prove that Ay = 0 at some moment, i.e. that the algorithm is correct, below).

Use the value of the parameter t to find the parameter s of the dyadic pattern as

p-i

s = D(xi,t) — trDr(xi).

r=0

Since #{¿1 '12} = p = log2 n, then the complexity of the algorithm is ©(log n) operations. 1.3.1. Correctness of the Algorithm

Theorem 1. Dj(x) - 1 < EA(X) ^ DAX) Vx £ 0,n- 1 and Vj G 0 ,p- 1.

j<j

Proof. For j G 0,p — 1, consider the sets Jj formed by the coordinates of the shift pixels of the basis dyadic pattern with slopes 2j, i.e. those for which the ordinate is greater than the ordinate of the previous one: Dj (x) = Dj (x — 1) + 1.

Since |Jj+i| = 2|Jj|, then, similar to a binary tree, where the number of leaves is

greater by unity than the number of other nodes, we obtain that Jj = E Jj + 1. According

j<j

to formula (2), the distance between two adjacent shift pixels is the same for the entire width of an image, and Jj if Jj = 0, Vi, j G [0,p — 1], i = j.

From the properties given above, it follows that the shift pixels of the set Jj alternate with the shift pixels of all other sets Jj (for i < j), and the first shift pixel belongs to the set Jj.

Therefore, Vx G [0,n — 1], the number of boundary pixels of Dj(x) is either equal to

or greater by unity than Y Dj (x).

j<j □

Corollary 1.

^ ¿X1'X2 — 1 < jX2 < ^ ¿X1'X2 + 1 (3)

j<j j<j

Vj G 0,p — 1 and Vxi, x2 G [0, n — 1] such that x\ > x2.

Proof. Rewrite Theorem 1 as E Dj (x) < Dj (x) < E Dj (x) + 1. Apply this pair of

j<j j<j

inequalities to the points xi and x2 and pairwise add the inequalities taking into account the signs.

Theorem 2. If the numbers ¿X1'12, ¿i1'12,..., ¿X1'12 satisfy condition (3), then the sums

of subset translations of their every possible subsets cover the range of values from 0 to k

EC'12. j=0

Proof. Let us prove by induction. First, we show that the statement is true for i = 0. Formula (2) implies that 5q1,X2 = [^zj] — [^j] > 0, since x2 > X\ by condition, i.e.

0 < ¿OX1'12 < 1.

Second, suppose that the statement is true for i = k — 1, that is, we can obtain all k-1

sums from 0 to ^X1^2 for the given numbers ^X^2, #!1'X2, •••,^X-'X2.

i=0

Finally, show that the statement is true for i = k. Inequality (3) is satisfied for S\

k .

k-1

In other words, the difference between ii1'12 and ^X1'12 is no more than 1. Therefore,

i=0

we can add ^Ц1'12 to every possible sum of the numbers i^1'12, i^1'12, •••, ^X-'i2 to obtain all

sums from 0 to if1'12. This completes the proof.

i=0 i

p-i x x

Therefore, if Ay < Y1 ^X1'12, then there exists a dyadic pattern passing through two

i=0 i

given pixels.

Theorem 3. The proposed algorithm to find a dyadic pattern always converges to a solution.

Proof. Since we go through ^X1'X2 from k = p — 1 to k = 0, then inequality (3) is satisfied for all CX2, ^X1'X2-CX2 at each stage, therefore, Theorem 2 is true for ^X1'X2, ^X1'X2•••^X-'X2. Hence, the problem has a solution at each stage. Therefore, we always obtain a solution to the problem.

The proposed algorithm and its properties are directly generalized to the case of straight lines in large dimensions. To this end, it is enough to solve the problem for projections of a straight line of the length n on the coordinate planes sequentially.

2. Fast Finding of the Sum Over a Segment of a Plane

Following the paper [9], we introduce the typification of planes in a three-dimensional image. A plane is said to be predominantly perpendicular to a coordinate axis, if an angle between the normal to the plane and this coordinate axis is smaller than each of angles between the normal and other coordinate axis. We consider only the planes that are predominantly perpendicular to the coordinate axis OX. Such planes are divided into four types depending on the slopes, see Fig. 4. Without loss of generality, we assume that the considered planes are of the type I.

Let EFGH be a dyadic plane in which it is necessary to find the sum over a segment. Similarly to the previous section, we consider the parameters (s,ti,t2) of the dyadic plane EFGH to be already found. As the segments, we consider the parallelograms, whose sides are parallel to the faces of the image. For example, see the parallelogram ABCD in Fig. 5.

Fig. 4. Typification of planes in the space given in the paper [9]

k

Let us briefly describe the procedure of pre-calculation. According to the paper [9], the algorithm of a three-dimensional FHT for planes is as follows. First, apply a two-dimensional FHT to each horizontal layer of a three-dimensional image (such an operation is called the horizontal FHT). Then, in the resulting image, apply a FHT to each vertical

F.

G

E K

G.

B C

A D

—A

H

M

E

N E

tC

+

MrrB

L

E

t

E K

Fig. 5. The dyadic plane EFGH containing the segment ABCD, and their projections to the coordinate plane OYX

plane (such an operation is called the vertical FHT). In order to construct a three-dimensional FHT pyramid, we apply the vertical FHT (and save all intermediate values) to the two-dimensional FHT pyramids constructed in the process of applying the horizontal FHT to each horizontal layer. In other words, the three-dimensional FHT pyramid contains the sums over all rectangles of the form 21 x 2m from the dyadic division of the Donoho-type plane.

Since the vertical FHT is applied to n log2 (n) planes, and the complexity of the two-dimensional FHT is ©(n2logn), then the described calculation of the three-dimensional FHT pyramid has the complexity ©(n3 log2 n).

The sum over the segment ABCD can be calculated by the inclusion-exclusion formula

SABCD = SEMCL — SEMBK — SENDL + •

Note that the point E is one of the angles for each segment. Further, we refer to such segments as the angle segments.

Consider the angle segment EMCL. It is necessary to divide EMCL into the segments presented in the FHT pyramid. Consider binary expansions of the width and height of a segment. Similarly to the one-dimensional case, divide the segment horizontally into log2 n vertical stripes and vertically into at most log2 n horizontal stripes.

Intersection of these stripes contains no more than log2(n) segments presented in the three-dimensional FHT pyramid, see Fig. 6. Therefore, the complexity of calculating the sum over an arbitrary segment is ©(log2 n) operations.

0 1 2 3

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

1 1 1 1 1

3

2

1

0

Fig. 6. Algorithm to find a segment of a plane taking into account the previously calculated three-dimensional FHT pyramid

Discussions

The algorithms similar to FHT have an important practical property: in order to calculate such an algorithm, it is enough to use the addition operation only. This fact is essential for the calculation complexity. Algorithms having good computational characteristics are especially in demand to create modern neural network detectors. For example, the paper [11] proposes an effective method to find the vanishing point of an image using a convolutional neural network containing FHT layers capable to detect straight lines in the image. In the paper [11], the use of FHT allows to reduce significantly the size of the neural network under the same quality of detection. By analogy, we can assume that, in the problems on detection of segments of the image, effective convolutional neural networks include layers that calculate the FHT pyramid.

Also, the developed solutions can be useful to create computer vision systems based on the algorithms of segment detection, for example, in combination with the methods proposed by D.L. Donoho and coauthors.

On the other hand, a number of nontrivial and interesting mathematical problems arise when the FHT is generalized to the cases of higher dimensions. In particular, the existence of a dyadic plane passing through any three given voxels is not proved yet, to say nothing of generalizations of these statements to the cases of higher dimensions.

The study and development of the algorithms similar to FHT has good potential both for use in various image-based systems of navigation and localization and for new mathematical problems.

Conclusion

In this paper, we present an algorithm for the fast search for segments of straight lines and segments of planes in the two-dimensional and three-dimensional cases using the corresponding fast Hough transformation as a pre-calculation. Also, we propose a method to find the parameters of a dyadic straight line passing through arbitrary two pixels of an image. In order to continue the development of this work, we plan to study the issue of proving the theorem that at least one dyadic plane passes through any three voxels, as well as to investigate the possibility to calculate fast the sum over an arbitrary triangular segment in a three-dimensional image.

Acknowledgments. The team of authors is grateful to A.P. Terekhin for his help in preparing wonderful illustrations for this paper, as well as to A.V. Savchik for valuable advices on how to simplify the proof of Theorems.

References

1. Donoho D.L., Xiaoming Huo. Beamlets and Multiscale Image Analysis. Berlin, Springer, 2002.

2. Arias-Castro E., Donoho D.L., Xiaoming Huo. Near-Optimal Detection of Geometric Objects by Fast Multiscale Methods. IEEE Transactions on Information Theory, 2005, vol. 51, no. 7, pp. 2402-2425.

3. Brady M.L., Whanki Yong. Fast Parallel Discrete Approximation Algorithms for the Radon Transform. Proceedings of the Fourth Annual ACM Symposium on Parallel Algorithms and Architectures, 1992, pp. 91-99

4. Nikolaev D.P., Karpenko S.M., Nikolaev T.P., Nikolaev P.P. Hough Transform: Underestimated Tool in the Computer Vision Field. Proceedings of the 22th European Conference on Modelling and Simulation, 3-6 June, Nicosia, 2008, vol. 238, pp. 246. D01:10.7148/2008-0238

5. Ershov E.I., Karpenko S.M. Transform and Approximation Properties of Dyadic Patterns. Available at: https://arxiv.1712.05615.

6. Khanipov T.M. Computational Complexity Lower Bounds of Certain Discrete Radon Transform Approximations. 2018. Available at: https://arxiv.1801.01054.

7. Skoryukina N., Shemyakina J., Arlazarov V.L., Faradzhev I. Document Localization Algorithms Based on Feature Points and Straight Lines. The Tenth International Conference on Machine Vision, 13-16 April, Vienna, vol. 10696, pp. 106961H.

8. Aliev M., Ershov E.I., Nikolaev D.P. On the Use of FHT, Its Modification for Practical Applications and the Structure of Hough Image. Eleventh International Conference on Machine Vision, 16-18 November, Amsterdam, 2019, vol. 11041, pp. 1104119. DOI: 10.1117/12.2522803

9. Ershov E.I., Terekhin A.P., Nikolaev D.P. Generalization of the Fast Hough Transform for Three-Dimensional Images. Journal of Communications Technology and Electronics, 2018, vol. 63, no. 6, pp. 626-636

10. Soshin K.V., Gladilin S.A., Ershov E.I. [About Fast Search Sums Through Segments on the Image With Pre-Calculated Hough-Pyramid]. 43 Mezhdistsiplinarnaya shkola-konferentsiya IPPI RAN "Informatsionnye tekhnologii i sistemy" [43-rd Interdisciplinary School-Conference IITP RAS "Information Technolgies and Systems"]. Perm, 2019. (in Russian)

11. Sheshkus A., Ingacheva A., Nikolaev D. Vanishing Points Detection Using Combination of Fast Hough Transform and Deep Learning. International Society for Optics and Photonics, 2018, vol. 10696, pp. 106960.

12. Arlazarov V.L., Dinits E.A., Kronrod M.A., Faradzhev I.A. [On Economical Construction of The Transitive Closure of a Directed Graph]. Doklady Academii Nauk, 1970, vol. 194, no. 3, pp. 487-488. (in Russian)

Received November 25, 2019

УДК 004.021 DOI: 10.14529/mmp200110

УСКОРЕНИЕ СУММАЦИИ ПО ОТРЕЗКАМ С ИСПОЛЬЗОВАНИЕМ ПИРАМИДЫ БЫСТРОГО ПРЕОБРАЗОВАНИЯ ХАФА

К.В. Сошин1'2, Д.П. Николаев2, С.А. Гладилин2, Е.И. Ершов2

1Московский физико-технический институт (НИУ), г. Долгопрудный, Российская Федерация

2Институт проблем передачи информации им. А.А. Харкевича, г. Москва, Российская Федерация

В работе предложен алгоритм быстрого приближенного вычисления на изображении сумм по произвольным отрезкам, задаваемым парой пикселей. Используя результаты промежуточных вычислений быстрого преобразования Хафа, предложенный

алгоритм позволяет рассчитать сумму по произвольному отрезку линии с логарифмической сложностью, зависящей от линейного размера исходного изображения. Пред-подсчет реализован как модификация алгоритма Брейди быстрой суммации по диади-ческим аппроксимациям прямых. При таком подходе ключевым элементом алгоритма является поиск диадической прямой, проходящей через два данных пикселя. В работе предложен алгоритм решения этой задачи, не ухудшающий общую асимптотику, для него доказана корректность. Также в работе описывается обобщение этого подхода на трехмерный случай для отрезков и для сегментов плоскостей.

Ключевые слова: поиск отрезков; быстрое преобразование Хафа; дискретное преобразование Радона; алгоритм Брейди; быстрое дискретное преобразование Радона; диадический паттерн; бимлет-пирамида.

Литература

1. Donoho, D.L. Beamlets and Multiscale Image Analysis / D.L. Donoho, Xiaoming Huo. -Berlin: Springer, 2002.

2. Arias-Castro, E. Near-Optimal Detection of Geometric Objects by Fast Multiscale Methods / E. Arias-Castro, D.L. Donoho, Xiaoming Huo // IEEE Transactions on Information Theory. - 2005. - Т. 51, № 7. - С. 2402-2425.

3. Brady, M.L. Fast Parallel Discrete Approximation Algorithms for the Radon Transform / M.L. Brady, Whanki Yong // Proceedings of the Fourth Annual ACM Symposium on Parallel Algorithms and Architectures. - 1992 - C. 91-99.

4. Nikolaev, D.P. Hough Transformation: Underestimated Tool in the Computer Vision Field / D.P. Nikolaev, S.M. Karpenko, I.P. Nikolaev, P.P. Nikolaev // Proceedings of the 22th European Conference on Modelling and Simulation, 3-6 June. - Nicosia, 2008. - T. 238 -C. 246

5. Ershov, E.I Hough Transform and Approximation Properties of Dyadic Patterns / E.I. Ershov, S.M. Karpenko. - 2017. - URL: https://arxiv.1712.05615

6. Khanipov, T.M. Computational Complexity Lower Bounds of Certain Discrete Radon Transform Approximations / T.M. Khanipov. - 2018. - URL: https://arxiv.801.01054

7. Skoryukina, N. Document Localization Algorithms Based on Feature Points and Straight Lines / N. Skoryukina, J. Shemyakina, V.L. Arlazarov, I. Faradzhev // The Tenth International Conference on Machine Vision. - 2018. - V. 10696. - P. 106961H.

8. Aliev, M. On the Use of FHT, Its Modification for Practical Applications and the Structure of Hough Image / M. Aliev, E.I. Ershov, D.P. Nikolaev // International Society for Optics and Photonics. - 2019. - V. 11041. - P. 1104119.

9. Ершов, Е.И. Обобщение быстрого преобразования Хафа для трехмерных изображений / Е.И. Ершов, А.П. Терехин, Д.П. Николаев // Информационные процессы. - 2017. -Т. 17, № 4. - С. 294-308.

10. Сошин, К.В. О быстром поиске сумм по сегментам за счет предподсчета Хаф-пирамид / К.В. Сошин, С.А. Гладилин, Е.И. Ершов //43 Междисциплинарная школа-конференция ИППИ РАН «Информационные технологии и системы:». - Пермь, 2019.

11. Sheshkus, A. Vanishing Points Detection Using Combination of Fast Hough Transform and Deep Learning / A. Sheshkus, A. Ingacheva, D. Nikolaev // International Society for Optics and Photonics. - 2018. - V. 10696. - C. 106960.

12. Арлазаров, В.Л. Об экономном построении транзитивного замыкания ориентированного графа / В.Л. Арлазаров, Е.А. Диниц, М.А. Кронрод, И.А. Фараджев // Доклады Академии наук СССР. - 1970. - Т. 194, № 3. - C. 487-488.

Константин Витальевич Сошин, студент бакалавриата, кафедра проблем передачи информации и анализа данных, Московский физико-технический институт (НИУ) (г. Долгопрудный, Российская Федерация); стажер-исследователь, Институт проблем передачи информации им. А.А. Харкевича (г. Москва, Российская Федерация), soshin.kv@phystech.edu.

Дмитрий Петрович Николаев, кандидат физико-математических наук, заместитель директора по научной работе, Институт проблем передачи информации им. А.А. Харкевича (г. Москва, Российская Федерация), dimonstr@iitp.ru.

Сергей Александрович Гладилин, кандидат физико-математических наук, старший научный сотрудник, Институт проблем передачи информации им. А.А. Харкевича (г. Москва, Российская Федерация), gladilin@iitp.ru.

Егор Иванович Ершов, кандидат физико-математических наук, старший научный сотрудник, Институт проблем передачи информации им. А.А. Харкевича (г. Москва, Российская Федерация), ershov@iitp.ru.

Поступила в редакцию 25 ноября 2019 г.

i Надоели баннеры? Вы всегда можете отключить рекламу.