RU2535184C2

RU2535184C2 - Method and apparatus for detecting local features on image

Info

Publication number: RU2535184C2
Application number: RU2013101659/08A
Authority: RU
Inventors: Владимир Иванович Марчук; Вячеслав Владимирович Воронин; Татьяна Владимировна Морозова; Марина Михайловна Письменскова
Priority date: 2013-01-11
Filing date: 2013-01-11
Publication date: 2014-12-10
Also published as: RU2013101659A

Abstract

FIELD: physics, computer engineering.

SUBSTANCE: invention relates to means of processing digital images. In the method, an image in the RGB colour space is converted to a grayscale; processed with Laws energy characteristics by folding the matrix of the source image with nuclei of the type of detecting an image in the form of ripples, detection of spots, detection of waves and detection of edges; for each of the obtained image, spots, corners and line edges are detected by a SURF method to find a local point using Hessian matrix.

EFFECT: detection of local features on an image.

2 cl, 2 dwg

Description

Изобретение относится к способам обработки цифровых изображений и может быть использовано в системах регистрации и наблюдения, мультимедийных приложениях работающих с визуальными данными.The invention relates to methods for processing digital images and can be used in registration and surveillance systems, multimedia applications working with visual data.

Упрощенная математическая модель изображения представляет собой модель в цветовом пространстве RGB в виде массива S_i,j,k, где $i = \bar{1, I}$

,

j = \bar{1, J}

- столбцы и строки изображения,

k = \bar{1.3}

- цветовой канал. При реализации метода поиска локальных особенностей используется изображение в градациях серого, т.е. каждая точка представлена уровнем яркости в диапазоне от 0 (черный) до 255 (белый), с промежуточными значениями, представляющими различные уровни серого. Соответственно такое изображение обозначим, как S_i,j.A simplified mathematical image model is a model in the RGB color space in the form of an array S _{i, j, k} , where

i = \bar{one, I}

,

j = \bar{one, J}

- columns and rows of the image,

k = \bar{1.3}

- color channel. When implementing the method of searching for local features, an image in grayscale is used, i.e. each dot is represented by a brightness level ranging from 0 (black) to 255 (white), with intermediate values representing different gray levels. Accordingly, we denote such an image as S _{i, j} .

Основная решаемая задача - детектирование локальных особенностей на изображении с определенными текстурными свойствами.The main problem to be solved is the detection of local features in the image with certain texture properties.

Несмотря на то что изображение представляет простую структуру - матрицу двумерных чисел в ней содержится больше количество информации о наблюдаемой сцене. Извлечь структурированную информацию из этой сцены довольно сложная задача. Когда речь идет о последовательности изображений, то задача становится еще более сложной, так как появляются пространственно-временные связи между кадрами. Таким образом, требуются устройства и методы, которые позволят извлекать и анализировать заложенную в видеопоследовательность информацию. Одним из таких подходов является поиск, отслеживание и сопоставление точечных особенностей на последовательности кадров. Это один из простых способов извлечь информацию о динамике сцены. Несколько точек, отслеживаемых в видеопоследовательности, могут давать огромное количество информации.Despite the fact that the image represents a simple structure - the matrix of two-dimensional numbers contains more information about the observed scene. Retrieving structured information from this scene is a daunting task. When it comes to the sequence of images, the task becomes even more complicated, as spatio-temporal relationships between frames appear. Thus, devices and methods are required that will extract and analyze the information embedded in the video sequence. One such approach is to search, track, and map point features on a frame sequence. This is one of the easiest ways to extract scene dynamics information. Several points tracked in a video sequence can provide a wealth of information.

Упрощенно способы детектирования локальных особенностей на изображении можно разделить на следующие группы:Simplified methods for detecting local features in the image can be divided into the following groups:

1) Способы обнаружения углов.1) Ways to detect angles.

2) Способы обнаружения точек.2) Methods for detecting points.

3) Способы обнаружения пятен.3) Methods for spot detection.

4) Способы обнаружения краев.4) Methods for detecting edges.

Анализ существующих способов детектирования показывает, что данные способы работают только с определенным типом локальных особенностей, что является основным их недостатком. Использование способов обнаружения углов на основе метода Моравека приводит к тому, что если угол не направлен в сторону соседней, то он не будет обнаружен как точечная особенность. Детектор Харриса не инвариантен к изменению масштаба изображения и алгоритм поиска углов Ши-Томаси очень чувствителен к шуму и к выбору масштабного уровня. Применение способов на основе обнаружения точек показывает, что метод SURF (Speeded Up Robust Features) не работает с объектами простой формы и без выраженных границ, а метод SIFT (Scale Invariant Feature Transform) не работает с зеркальными поверхностям и чувствителен к изменению освещения. Для использования способов обнаружения краев на основе оператора Кирша требуется больших вычислительных затрат, что ограничивает его использование в масштабе реального времени. Способы обнаружения пятен (областей) на основе детектора областей MSER (Maximally Stable Extremal Regions) приводит к тому, что он не инвариантен к масштабу при слабой контрастности, но нормализует все 6 параметров аффинных преобразований. Детектор областей FAST (Features accelerated segment testing) определяет локальные особенности достаточно быстро.An analysis of existing detection methods shows that these methods work only with a certain type of local features, which is their main drawback. Using methods for detecting angles based on the Moravek method leads to the fact that if the angle is not directed towards the neighboring one, then it will not be detected as a point feature. The Harris detector is not invariant to zooming and the Shi-Tomashi angle search algorithm is very sensitive to noise and to choosing a scale level. The use of methods based on point detection shows that the SURF (Speeded Up Robust Features) method does not work with objects of simple shape and without pronounced boundaries, and the SIFT (Scale Invariant Feature Transform) method does not work with mirror surfaces and is sensitive to changes in lighting. Using edge detection methods based on the Kirsch operator requires large computational costs, which limits its use in real time. Methods for detecting spots (areas) based on the MSER (Maximally Stable Extremal Regions) region detector leads to the fact that it is not scale invariant with low contrast, but normalizes all 6 parameters of affine transformations. The FAST area detector (Features accelerated segment testing) detects local features quickly enough.

Известен способ и устройство для обнаружения объекта на изображении [Патент US №6711293, МПК G06K 9/68]. Изобретение относится к способам распознавания объектов в системах машинного зрения.A known method and device for detecting an object in the image [US Patent No. 6711293, IPC G06K 9/68]. The invention relates to methods for recognizing objects in machine vision systems.

Основная техническая задача состоит в распознавании объектов на изображении, с использованием инвариантной функции масштаба. На первом этапе выполняется свертка изображения с функцией Гаусса, далее используется пирамида Гаусса для построения разностного изображения. На втором этапе находят локальные экстремумы путем сравнения каждого пикселя изображения с несколькими соседними данного масштаба. На следующем этапе выделяют область вокруг точек экстремума. На четвертом вычисляется направление градиентов некоторой окрестности и на пятом этапе для каждой точки экстремума вычисляется локальный дескриптор.The main technical task is to recognize objects in the image using an invariant scale function. At the first stage, the image is convolved with the Gaussian function, then the Gauss pyramid is used to construct the difference image. At the second stage, local extrema are found by comparing each pixel of the image with several neighboring pixels of a given scale. At the next stage, an area around the points of extremum is distinguished. At the fourth, the direction of the gradients of a certain neighborhood is calculated, and at the fifth stage, for each extremum point, a local descriptor is calculated.

Признаки способа-аналога, совпадающие с признаками заявляемого технического решения, следующие:The signs of the method-analogue, coinciding with the signs of the proposed technical solution, are as follows:

- инвариантность к изменению масштаба;- invariance to zooming;

- нахождение локальных экстремумов.- finding local extremes.

Недостатками известного способа и устройства его реализующего являются:The disadvantages of the known method and device that implements it are:

- низкая точность определения локальных дескрипторов.- low accuracy of determining local descriptors.

Причины, препятствующие достижению требуемого технического результата, заключаются в следующем:The reasons that impede the achievement of the required technical result are as follows:

- так как функция Гаусса обладает ограниченным набором инвариантных функций, характеризующих особенности изображения, описываемые векторами-признаками, и тем самым данные вектора являются менее информативными.- since the Gaussian function has a limited set of invariant functions characterizing the image features described by feature vectors, and thus the vector data are less informative.

Известен способ распознавания объектов [Патент №2438174 С1, МПК G06K 9/68]. Изобретение относится к способам распознавания объектов в системах машинного зрения, телевизионных системах наблюдения, информационно-управляющих системах робототехнических комплексов.A known method of recognition of objects [Patent No. 2438174 C1, IPC G06K 9/68]. The invention relates to methods for recognizing objects in machine vision systems, television surveillance systems, information and control systems of robotic complexes.

Основная техническая задача, решаемая заявляемым изобретением, состоит в создании способа, позволяющего повысить точность распознавания за счет повышения стабильности работы детекторов ключевых областей на изображении и увеличения количества инвариантных характеристик данных детекторов. Способ осуществляется следующим образом: входное изображение сворачивается с заданной функцией Грина:The main technical problem solved by the claimed invention is to create a method that can improve recognition accuracy by improving the stability of the detectors of key areas in the image and increasing the number of invariant characteristics of these detectors. The method is as follows: the input image is minimized with the given Green function:

$U (y, τ | y_{0}, τ_{0}) = \frac{\sqrt{λ}}{\sqrt{D π (1 - e^{- 2 λ (τ - τ_{0})})}} * \exp [- \frac{\sqrt{λ}}{D π (1 - e^{- 2 λ (τ - τ_{0})})} {(y - y_{0} e^{- λ (τ - τ_{0})})}^{2}]$

U (y, τ | | | y_{0}, τ_{0}) = \frac{\sqrt{λ}}{\sqrt{D π (one - e^{- 2 λ (τ - τ_{0})})}} * \exp [- \frac{\sqrt{λ}}{D π (one - e^{- 2 λ (τ - τ_{0})})} {(y - y_{0} e^{- λ (τ - τ_{0})})}^{2}]

для эволюционного оператора процесса Орнштейна-Уленбека при различных значениях параметров τ, λ.for the evolutionary operator of the Ornstein-Uhlenbeck process for various values of the parameters τ, λ.

$\partial_{τ} U (y, τ) = \partial_{y} (λ y U (y, τ)) + \frac{D}{2} \partial_{y y} U (y, τ)$

,

\partial_{τ} U (y, τ) = \partial_{y} (λ y U (y, τ)) + \frac{D}{2} \partial_{y y} U (y, τ)

,

где τ, λ - параметры, используемые для определения сверток на различных масштабах;where τ, λ are the parameters used to determine convolutions at various scales;

D - константа, которая подбирается опытным путем для различных категорий изображений;D is a constant that is selected empirically for various categories of images;

y=(a, b) - пространственная временная функция, суть координаты (a,b) пикселя на изображении.y = (a, b) is the spatial time function, the essence of the coordinate (a, b) of the pixel in the image.

Полученные свертки вычитают друг из друга для получения конечно-разностной аппроксимации первой производной свертки входного изображения с фильтром. При поиске локального экстремума данной свертки приравнивают к нулю соответствующие первые производные. Находят все локальные экстремумы и проводят адаптивную пороговую фильтрацию для отсечения незначительных особенностей. Выделенные таким способом точки служат центрами окрестностей, для которых строят произвольные дескрипторы.The resulting convolutions are subtracted from each other to obtain a finite-difference approximation of the first derivative of the convolution of the input image with the filter. When searching for a local extremum of a given convolution, the corresponding first derivatives are equated to zero. All local extremes are found and adaptive threshold filtering is performed to cut off minor features. Points distinguished in this way serve as centers of neighborhoods for which arbitrary descriptors are constructed.

- изменяя набор симметрии выбором соответствующей функции Грина, получать для одного и того же изображения векторы-признаки, отражающие различные свойства изображения, повышая тем самым информативность представления изображения.- changing the symmetry set by choosing the corresponding Green's function, to obtain feature vectors for the same image that reflect various properties of the image, thereby increasing the information content of the image.

- детектирование особенностей в виде локальных областей экстремума.- detection of features in the form of local areas of extremum.

- использование способа свертки с функцией Грина не позволяет детектировать различные типы текстур.- the use of the convolution method with the Green function does not allow to detect various types of textures.

Известен способ выделения контуров движущихся объектов [Патент №2466456 С2, МПК G06K 9/50]. Изобретение относится к области распознания образов и может быть использовано в системах технического зрения при решении задач предварительной обработки изображения.A known method for highlighting the contours of moving objects [Patent No. 2466456 C2, IPC G06K 9/50]. The invention relates to the field of pattern recognition and can be used in vision systems for solving problems of preliminary image processing.

Основная техническая задача, решаемая заявляемым изобретением, состоит в повышении точности определения областей движущихся объектов и повышения скорости выделения контуров изображения.The main technical problem solved by the claimed invention is to increase the accuracy of determining the areas of moving objects and increase the speed of selection of image contours.

Структурная схема устройства включает в себя:The block diagram of the device includes:

- выделение контуров движущихся объектов, включая обнаружение движущихся пикселей по принципу межкадрового вычитания;- selection of the contours of moving objects, including the detection of moving pixels according to the principle of inter-frame subtraction;

- определение направления движения обнаруженных пикселей с учетом направления движения смежных пикселей;- determining the direction of motion of the detected pixels, taking into account the direction of motion of adjacent pixels;

- формирование движущихся объектов путем объединения смежных пикселей с одним направлением и наличия пикселей в восьмисвязной окрестности;- the formation of moving objects by combining adjacent pixels with one direction and the presence of pixels in an eight-connected neighborhood;

- пространственное дифференцирование обнаруженных объектов оператором Собела:- spatial differentiation of detected objects by the Sobel operator:

$\begin{array}{l} G_{r} = 〈 G (x, y), v (x, y) 〉 = f_{e d} (S (k), I (k)) = \\ 〈 \sqrt{{(\frac{δ I (x, y)}{\partial x})}^{2}} + {(\frac{δ I (x, y)}{\partial y})}^{2}, a r c t g (α) 〉 \end{array}$

\begin{array}{l} G_{r} = 〈 G (x, y), v (x, y) 〉 = f_{e d} (S (k), I (k)) = \\ 〈 \sqrt{{(\frac{δ I (x, y)}{\partial x})}^{2}} + {(\frac{δ I (x, y)}{\partial y})}^{2}, a r c t g (α) 〉 \end{array}

$α = (\frac{(\frac{\partial I (x, y)}{\partial y})}{(\frac{\partial I (x, y)}{\partial x})}), \forall (x, y) \in S (k)$

α = (\frac{(\frac{\partial I (x, y)}{\partial y})}{(\frac{\partial I (x, y)}{\partial x})}), \forall (x, y) \in S (k)

- скелетизация контурных линий на градиентном изображении методом подавления точек немаксимальной яркости, пороговая обработка скелетного градиентного изображения на основе максимума и минимума энтропии.- skeletonization of contour lines in a gradient image by suppressing non-maximum brightness points, threshold processing of a skeletal gradient image based on the maximum and minimum of entropy.

- выделение контуров объектов на изображении.- selection of the contours of objects in the image.

- низкая точность определения локальных дескрипторов. - low accuracy of determining local descriptors.

- пространственное дифференцирование оператором Собела при наличии текстурированных объектов приводит к значительным ложным детектированиям.- spatial differentiation by the Sobel operator in the presence of textured objects leads to significant false detections.

Известен способ детектирования особых точек и устройство, реализующее его [Патент US №2007/0071289 А1, МПК G06K 9/00]. Изобретение относится к детектированию точечных особенностей и способу детектирования локальных особенностей лица.A known method for detecting singular points and a device that implements it [US Patent No. 2007/0071289 A1, IPC G06K 9/00]. The invention relates to the detection of point features and a method for detecting local features of a face.

Основная техническая задача, решаемая заявляемым изобретением, состоит в создании устройства и способа корректного детектирования локальных особенностей на изображении лица одной точечной нормализацией и многоточечной нормализацией распознавания образов.The main technical problem solved by the claimed invention is to create a device and method for correctly detecting local features in a face image with one point normalization and multi-point normalization of pattern recognition.

Структурная схема устройства для обнаружения локальных особенностей включает в себя:The structural diagram of a device for detecting local features includes:

- запоминающее устройство, настроенное на хранение первого шаблона для первой локальной точки объекта, второго шаблона для второй локальной точки, а третий шаблон для комбинации первой и второй локальной точки;- a storage device configured to store the first pattern for the first local point of the object, the second pattern for the second local point, and the third pattern for the combination of the first and second local points;

- устройство ввода изображений;- image input device;

- блок кандидат;- block candidate;

- первый блок распознавания образов, извлекает множество третьих кандидатов из множества первых кандидатов на основе сходства между первым кандидатом и первым шаблоном, а также извлекает множество четвертого кандидата из множества вторых кандидатов на основе сходства между каждым вторым кандидатом и вторым шаблоном;- the first pattern recognition unit, extracts a plurality of third candidates from the plurality of first candidates based on similarities between the first candidate and the first template, and also extracts a plurality of fourth candidates from the plurality of second candidates based on similarities between each second candidate and the second template;

- второй блок распознавания образов, создает множество первых комбинаций каждого третьего кандидата, а каждый четвертый кандидат извлекается второй комбинацией из множества первых сочетаний, основанных на третьем сходстве между первой комбинацией и третьим шаблоном.- the second pattern recognition unit, creates many of the first combinations of every third candidate, and every fourth candidate is extracted by the second combination of the many first combinations based on the third similarity between the first combination and the third template.

- изобретение предназначено только для детектирования точечных особенностей локальных особенностей лица.- the invention is intended only for the detection of point features of local features of the face.

Известен способ обнаружения углов на основе метода Моравека [Moravec Н (1980). "Obstacle Avoidance and Navigation in the Real World by a Seeing Robot Rover"]. Данный способ проверяет каждый пиксель в изображении, чтобы определить является ли тот углом, рассматривая участки в области пикселя. Сходство определяется путем принятия суммы квадратов разностей между двумя участками. Меньшее число указывает на большее сходство.A known method for detecting angles based on the Moravec method [Moravec N (1980). "Obstacle Avoidance and Navigation in the Real World by a Seeing Robot Rover"]. This method checks every pixel in the image to determine if that pixel is an angle by looking at areas in the pixel area. Similarity is determined by taking the sum of the squared differences between the two sections. A smaller number indicates a greater similarity.

Способ осуществляется следующим образом:The method is as follows:

1. Используется тот факт, что однородные и реберные участки изображения самоподобны.1. The fact that the homogeneous and ribbed portions of the image is self-similar is used.

2. В каждом пикселе считается функция самоподобности окрестностей с различным радиусом у этого пиксела.2. In each pixel, the self-similarity function of neighborhoods with a different radius for this pixel is considered.

3. Строится карта самоподобности.3. A self-similarity map is built.

4. Выбираются локальные максимумы рассогласования различных окрестностей пиксела.4. Local mismatch maxima of various pixel neighborhoods are selected.

5. Если величина локального максимума больше порога, то точка классифицируется как угловая:5. If the local maximum is greater than the threshold, then the point is classified as angular:

$E_{x, y} = \sum_{u, v} w_{u, v} {[I_{x + u, y + u} - I_{u, v}]}^{2}$

,

E_{x, y} = \sum_{u, v} w_{u, v} {[I_{x + u, y + u} - I_{u, v}]}^{2}

,

где I - обозначается интенсивность изображения; w - окно, которое определяет область изображения с коэффициентом единица.where I is the image intensity; w is a window that defines an image area with a coefficient of unity.

- детектирование локальных особенностей в виде углов.- detection of local features in the form of angles.

Недостатками известного способа являются:The disadvantages of this method are:

- если угол не направлен в сторону соседней, то он не будет обнаружен, как точечная особенность.- if the angle is not directed towards the adjacent one, then it will not be detected as a point feature.

Известен способ обнаружения углов с помощью преобразования Хафа [Шапиро Л., Дж. Стокман. Компьютерное зрение; Пер с англ. - М.: БИНОМ. Лаборатория знаний, 2006. - С.413]. Данный способ преобразования Хафа использует аккумуляторный массив, размерность которого соответствует количеству неизвестных параметров в уравнении семейства искомых кривых. Области углов на изображении можно обнаружить посредством поиска пар выделенных краевых сегментов E1 и Е2, удовлетворяющих перечисленным ниже условиям:A known method of detecting angles using the Hough transform [Shapiro L., J. Stockman. Computer vision; Per from English. - M .: BINOM. Laboratory of Knowledge, 2006. - S. 413]. This Hough transform method uses a battery array whose dimension corresponds to the number of unknown parameters in the equation of the family of desired curves. Angle regions in the image can be detected by searching for pairs of selected edge segments E1 and E2 that satisfy the conditions listed below:

- аппроксимирующие прямые для множеств краевых точек E1 и Е2 пересекаются в точке [u,v], координаты которой определены в действительной системе координат изображения;- approximating straight lines for the sets of boundary points E1 and E2 intersect at the point [u, v], the coordinates of which are defined in the actual coordinate system of the image;

- точка [u,v] лежит близко от крайних точек множеств E1 и Е2;- the point [u, v] lies close to the extreme points of the sets E1 and E2;

- направления градиентов E1 и Е2 симметричны относительно оси симметрии двух этих краев.- the directions of the gradients E1 and E2 are symmetrical about the axis of symmetry of these two edges.

Исходные краевые сегменты выделяются с помощью преобразования Хафа путем прослеживания границ и последующей аппроксимации прямыми или каким-либо другим подходящим алгоритмом. Для каждой пары отрезков ([d1,θ1], [d2,θ2]), удовлетворяющей перечисленным условиям, четверка элементов (d1,θ1], [d2,θ2], [u,v], α) добавляется в множество кандидатов на углы (где α - это величина угла). Сформированное множество угловых характерных признаков применяется в различных целях, например для построения каких-либо высокоуровневых описаний.The original edge segments are extracted using the Hough transform by tracing the boundaries and then approximating them by straight lines or some other suitable algorithm. For each pair of segments ([d1, θ1], [d2, θ2]) satisfying the above conditions, the four elements (d1, θ1], [d2, θ2], [u, v], α) are added to the set of candidates for angles (where α is the value of the angle). The formed set of angular characteristic features is used for various purposes, for example, for constructing any high-level descriptions.

Аккумуляторный массив А используется в алгоритме Хафа для проверки каждого пиксела изображения и его окрестности. Определяется, присутствует ли в данном пикселе достаточно выраженный край. Если присутствует, то вычисляются параметры искомой кривой, проходящей через данный пиксел. После оценки параметров прямой в данном пикселе они дискретизируются для получения соответствующих значений М и В и значение элемента массива А[М,В] увеличивается. После обработки пикселов выполняется поиск локальных максимумов в аккумуляторном массиве. Точки локальных максимумов соответствуют параметрам наиболее вероятных прямых на изображении.Battery array A is used in the Hough algorithm to check each pixel in the image and its surroundings. Determines whether a sufficiently pronounced edge is present in a given pixel. If present, then the parameters of the desired curve passing through the given pixel are calculated. After evaluating the parameters of the line in this pixel, they are sampled to obtain the corresponding values of M and B, and the value of the array element A [M, B] increases. After processing the pixels, local maxima are searched in the battery array. The points of local maxima correspond to the parameters of the most probable lines in the image.

- приведенные условия определяют только углы типа 'L';- the above conditions determine only angles of type 'L';

- второе условие исключает из рассмотрения углы типа 'T', 'X' и 'Y';- the second condition excludes from consideration the angles of type 'T', 'X' and 'Y';

- аккумуляторный массив позволяет определить параметры бесконечно протяженных прямых или кривых линий, но с его помощью нельзя определить, где именно на изображении начинаются и заканчиваются отрезки линий.- the battery array allows you to determine the parameters of the infinitely long straight or curved lines, but with its help it is impossible to determine where in the image the line segments begin and end.

Известен способ Blob обнаружения на основе Лапласиан Гауссиан [Lindeberg Т. (1998). "Feature detection with automatic scale selection". International Journal of Computer Vision 30 (2): pp.77-116], который основан на выполнении одноразовой цифровой операции полосовой фильтрации пространственной функции изображения с использованием оператора лапласиана гауссиана.A known method of Blob detection based on Laplacian Gaussian [Lindeberg T. (1998). "Feature detection with automatic scale selection". International Journal of Computer Vision 30 (2): pp.77-116], which is based on a one-time digital operation of band-pass filtering of the spatial function of an image using the Laplacian Gaussian operator.

Данный способ использует свойство коммутативности линейных операторов Гаусса и Лапласа:This method uses the commutativity property of the linear Gauss and Laplace operators:

$L \times (G \times I) = (L \times G) \times I = L o C \times I$

,

L \times (G \times I) = (L \times G) \times I = L o C \times I

,

где × - оператор свертки и возможности объединения их в один путем изменения порядка выполнения операций дифференцирования и сглаживания. Объединенный оператор получил название оператора лапласиана гауссиана LoG (LoG - Laplacian of Gaussian) и представляет собой свертку операторов Лапласа и Гаусса.where × is the convolution operator and the possibility of combining them into one by changing the order of differentiation and smoothing operations. The combined operator is called the Laplacian Gaussian operator LoG (LoG - Laplacian of Gaussian) and is a convolution of the Laplace and Gaussian operators.

- детектирование локальных особенностей в виде областей.- detection of local features in the form of regions.

- операторы Гаусса и Лапласа являются ненаправленными;- Gauss and Laplace operators are non-directional;

- чувствительность к изменению яркости в параллельном направлении, что уменьшает соотношение сигнал/шум.- sensitivity to brightness changes in the parallel direction, which reduces the signal-to-noise ratio.

Известен способ детектирования краев методом Кирша [Шапиро Л., Дж. Стокман. Компьютерное зрение; Пер с англ. - М.: БИНОМ. Лаборатория знаний, 2006. - С.413]. Оператор Кирша - нелинейный детектор края, который находит максимальную толщину края в нескольких направлениях. Взяв маску, оператор вращает ее в 8 основных направлениях компаса: север, северо-запад, запад, юго-запад, юг, юго-восток, восток, северо-восток.A known method for detecting edges by the Kirsch method [Shapiro L., J. Stockman. Computer vision; Per from English. - M .: BINOM. Laboratory of Knowledge, 2006. - S. 413]. The Kirsch operator is a nonlinear edge detector that finds the maximum edge thickness in several directions. Taking the mask, the operator rotates it in 8 main directions of the compass: north, northwest, west, southwest, south, southeast, east, northeast.

Метод Кирша работает с двумерной апертурой 3×3 следующего вида:The Kirsch method works with a two-dimensional 3 × 3 aperture of the following form:

А0A0 A1A1 A2A2 А7A7 FF A3A3 А6A6 A5A5 A4A4

Сначала находятся все значения переменных S_i и T_i, где i изменяется от 0 до 7, производится вычисление двух сумм, одна из которых содержит 3 элемента, последовательно выбранные при обходе «по кругу» крайних элементов, вошедших в окно, вторая сумма включает оставшиеся пять элементов:First, all the values of the variables S _i and T _{i are found} , where i varies from 0 to 7, two sums are calculated, one of which contains 3 elements, sequentially selected during the round-trip of the extreme elements included in the window, the second sum includes the remaining five elements:

$S_{i} = \sum_{j = i}^{(i + 2) \mod 8} A_{j}$

S_{i} = \sum_{j = i}^{(i + 2) \mod 8} A_{j}

$T_{i} = \sum_{j = i + 3}^{(i + 7) \mod 8} A_{j}$

T_{i} = \sum_{j = i + 3}^{(i + 7) \mod 8} A_{j}

Далее значения сумм подвергаются сравнению при помощи взвешенной разности по модулю, что определяет наличие либо отсутствие контраста в рассматриваемом направлении:Further, the values of the sums are compared using the weighted difference modulo, which determines the presence or absence of contrast in the considered direction:

$Y_{i} = | 5 S_{i} - 3 T_{i} |$

Y_{i} = | | | 5 S_{i} - 3 T_{i} | | |

Выбирается максимальный перепад контраста из всех направлений, направление изменяется за счет вычисления всех возможных разностей, последовательно меняя стартовую позицию начального элемента (i):The maximum contrast difference is selected from all directions, the direction is changed by calculating all possible differences, sequentially changing the starting position of the initial element (i):

$F' = \underset{i = 0,7}{\max {Y_{i}}}$

F'' = \underset{i = 0.7}{\max {Y_{i}}}

В точку со значением F записывается полученное значение, фильтр сдвигается вправо.The resulting value is written to the point with the value F, the filter shifts to the right.

Признаки устройства-аналога, совпадающие с признаками заявляемого технического решения, следующие:The features of the analog device, matching the features of the claimed technical solution, are as follows:

- детектирование локальных особенностей в виде границ.- detection of local features in the form of boundaries.

- требуется большое количество времени для расчета локальных особенностей.- A large amount of time is required to calculate local features.

Известен способ детектирования точек методом SIFT (Scale Invariant Feature Transform) [David G. Lowe «Distinctive image features from scale-invariant keypoints» International Journal of Computer Vision, 60, 2(2004), pp.91-110]. Главным этапом в детектировании особых точек является построение пирамиды гауссианов (Gaussian) и разностей гауссианов (Difference of Gaussian, DoG).A known method for detecting points using the SIFT method (Scale Invariant Feature Transform) [David G. Lowe "Distinctive image features from scale-invariant keypoints" International Journal of Computer Vision, 60, 2 (2004), pp. 91-110]. The main step in the detection of singular points is the construction of a pyramid of Gaussians (Differences of Gaussian, DoG).

Способ осуществляется следующим образом: точка считается особой, если она является локальным экстремумом разности гауссианов. Для поиска экстремумов будем использовать следующий метод. В каждом изображении из пирамиды DoG ищутся точки локального экстремума. Каждая точка текущего изображения DoG сравнивается с ее восьмью соседями и с девятью соседями в DoG, находящимися на уровень выше и ниже в пирамиде. Если эта точка больше (меньше) всех соседей, то она принимается за точку локального экстремума.The method is as follows: a point is considered singular if it is a local extremum of the difference of the Gaussians. To search for extrema we will use the following method. In each image from the DoG pyramid, local extremum points are searched. Each point of the current DoG image is compared with its eight neighbors and with nine neighbors in the DoG, located one level higher and lower in the pyramid. If this point is larger (less) than all neighbors, then it is taken as a point of local extremum.

Следующим шагом будет пара проверок пригодности точки экстремума на роль ключевой. Первым делом определяются координаты особой точки с субпиксельной точностью. Это достигается с помощью аппроксимирования функции DoG многочленом Тейлора второго порядка, взятого в точке вычисленного экстремума.The next step will be a couple of checks on the suitability of the extremum point for the key role. First of all, the coordinates of the singular point are determined with subpixel accuracy. This is achieved by approximating the DoG function by the second-order Taylor polynomial taken at the point of the calculated extremum.

$D (x) = D + \frac{\partial D^{T}}{\partial x} x + \frac{1}{2} x^{T} \frac{\partial^{2} D}{\partial x^{2}} x$

,

D (x) = D + \frac{\partial D^{T}}{\partial x} x + \frac{one}{2} x^{T} \frac{\partial^{2} D}{\partial x^{2}} x

,

здесь D - функция DoG, X=(x,y,sigma) - вектор смещения относительно точки разложения, первая производная DoG - градиент, вторая производная DoG - матрица Гессе.here D is the DoG function, X = (x, y, sigma) is the displacement vector with respect to the decomposition point, the first DoG derivative is the gradient, the second DoG derivative is the Hessian matrix.

Экстремум многочлена Тейлора находится путем вычисления производной и приравнивания ее к нулю. В итоге получим смещение точки вычисленного экстремума относительно точногоThe extremum of the Taylor polynomial is found by calculating the derivative and equating it to zero. As a result, we obtain the displacement of the point of the calculated extremum relative to the exact

$\hat{x} = - \frac{\partial^{2} D^{- 1}}{\partial x^{2}} \frac{\partial D}{\partial x}$

,

\hat{x} = - \frac{\partial^{2} D^{- one}}{\partial x^{2}} \frac{\partial D}{\partial x}

,

Все производные вычисляются по формулам конечных разностей. Если одна из компонент вектора $\hat{X}$

больше 0.5*шаг сетки в этом направлении, то это означает, что на самом деле точка экстремума была вычислена неверно и нужно сдвинуться к соседней точке в направлении указанных компонент. Для соседней точки все повторяется заново. Если таким образом выходим за пределы октавы, то следует исключить данную точку из рассмотрения.All derivatives are calculated by finite difference formulas. If one of the components of the vector

\hat{X}

more than 0.5 * grid step in this direction, this means that in fact the extremum point was calculated incorrectly and you need to move to an adjacent point in the direction of the indicated components. For a neighboring point, everything repeats again. If in this way we go beyond the octave, then this point should be excluded from consideration.

Когда положение точки экстремума вычислено, то значение DoG сравнивается с порогом в точке по формуле:When the position of the extremum point is calculated, the DoG value is compared with the threshold at the point by the formula:

$D (\hat{x}) = D + \frac{1}{2} \frac{\partial D^{T}}{\partial x} \hat{x}$

D (\hat{x}) = D + \frac{one}{2} \frac{\partial D^{T}}{\partial x} \hat{x}

Если эта проверка не проходит, то точка исключается из рассмотрения, как точка с малым контрастом.If this check fails, then the point is excluded from consideration, as a point with low contrast.

- не работает с отражающими поверхностями.- does not work with reflective surfaces.

Наиболее близким к изобретению относится способ SURF (Speeded Up Robust Features) [Bay H., Tuytelaars T. (2006) Ess A. Speeded-Up Robust Features (SURF) in LNCS 3951 (ECCV'06), vol. 1, 2006, pp.404-417]. Способ SURF позволяет находить локальные точки с помощью матрицы Гессе и хорошо детектирует пятна и углы. Гессиан инвариантен относительно вращения, но не инвариантен масштабу. Поэтому метод SURF использует разномасштабные фильтры для нахождения гессианов.Closest to the invention relates to a method of SURF (Speeded Up Robust Features) [Bay H., Tuytelaars T. (2006) Ess A. Speeded-Up Robust Features (SURF) in LNCS 3951 (ECCV'06), vol. 1, 2006, pp. 404-417]. The SURF method allows you to find local points using the Hessian matrix and well detects spots and angles. Hessian is rotation invariant, but not scale invariant. Therefore, the SURF method uses multiscale filters to find the Hessians.

Для каждой локальной точки считается направление максимального изменения яркости (градиент) и масштаб, взятый из масштабного коэффициента матрицы Гессе. Градиент в точке вычисляется с помощью фильтров Хаара. Обнаружение особых локальных точек в методе SURF использует бинаризированную аппроксимацию лаплассиана-гауссиана.For each local point, the direction of the maximum brightness change (gradient) and the scale taken from the scale factor of the Hessian matrix are considered. The gradient at the point is calculated using Haar filters. The detection of singular local points in the SURF method uses a binarized approximation of the Laplacian-Gaussian.

Признаки способа-прототипа, совпадающие с признаками заявляемого технического решения, следующие:The features of the prototype method, which coincides with the features of the proposed technical solution, are as follows:

- детектирование локальных особенностей в виде областей и углов.- detection of local features in the form of areas and angles.

- частично инвариантен к освещению и положению оптических камер;- partially invariant to lighting and position of optical cameras;

- не работает с отражающими поверхностями и сильно выраженной геометрической структурой.- Does not work with reflective surfaces and a pronounced geometric structure.

Наиболее близким к изобретению является способ и система для обнаружения лиц [Патент №2008145913/28, МПК G06K 9/00]. Рассматриваемое устройство-прототип предполагает выполнение следующих операций:Closest to the invention is a method and system for detecting faces [Patent No. 2008145913/28, IPC G06K 9/00]. Consider a prototype device involves the following operations:

1) Выполняется построение карт градиентов входящего изображения, результатом работы которого является вычисление изменений интенсивности (или цветных компонент) соседних пикселей или групп и оценка направления наибольшего изменения в каждой точке изображения;1) The mapping of gradients of the input image is performed, the result of which is the calculation of changes in the intensity (or color components) of neighboring pixels or groups and estimates the direction of the largest change at each point in the image;

2) Проводится поиск дуг окружностей на картах градиентов, основным результатом работы которого является получение многомерной весовой карты возможных местоположений лица с учетом геометрических параметров дуг: радиусов, длины и расположения;2) A search is made for circular arcs on gradient maps, the main result of which is to obtain a multidimensional weight map of possible locations of the face, taking into account the geometric parameters of the arcs: radii, length and location;

3) Проводится поиск локальных особенностей на входящем изображении и градиентных картах, результатом работы будет являться набор выделенных локальных особенностей с их геометрическими характеристиками;3) A search is made for local features in the incoming image and gradient maps, the result of the work will be a set of selected local features with their geometric characteristics;

4) Выполняется построение набора гипотез возможных положений лиц, выходной информацией служит набор гипотез, каждая из которых определяет возможное положение лица на исходном изображении, с учетом его размеров и ориентации;4) The construction of a set of hypotheses of possible positions of persons is carried out, the output information is a set of hypotheses, each of which determines the possible position of a person in the original image, taking into account its size and orientation;

5) Осуществляется верификация построенных гипотез.5) Verification of the constructed hypotheses is carried out.

Устройство для обнаружения лиц включает: блок регистрации (памяти) входящего изображения; блок построения карт градиентов входящего изображения; блок поиска дуг окружностей на картах градиентов; блок поиска локальных особенностей на входящем изображении и градиентных картах; блок построения набора гипотез возможных положений лиц; блок предобработки; блок-детектор лица; блок верификации построения гипотез.A device for detecting faces includes: an input image registration (memory) unit; block for constructing gradient maps of the incoming image; block search for circular arcs on gradient maps; block search for local features in the incoming image and gradient maps; a block for constructing a set of hypotheses of possible positions of persons; preprocessing unit; block face detector; hypothesis verification block.

Недостатками известного устройства-прототипа являются:The disadvantages of the known prototype device are:

- детектирование определенных локальных особенностей с целью обнаружения местоположения лица на изображении.- detecting certain local features in order to detect the location of the face in the image.

- в качестве детектирования локальных особенностей может выступать метод SURF, соответственно такой метод обнаруживает только локальные особенности в виде областей.- the SURF method can act as a detection of local features, respectively, such a method detects only local features in the form of regions.

Суть предлагаемого способа детектирования локальных особенностей на изображении состоит в следующем (фиг.1).The essence of the proposed method for detecting local features in the image is as follows (figure 1).

На первом шаге исходное изображение, которое представлено моделью в цветовом пространстве RGB, преобразуется в градации серого для реализации метода поиска локальных особенностей.In the first step, the original image, which is represented by the model in the RGB color space, is converted to grayscale to implement a method for searching for local features.

На втором шаге происходит выбор интересующей локальной особенности с помощью обработки текстурными энергетическими характеристиками Лаве (Laws) [Laws К. Textured image segmentation, Ph.D. dissertation, University of Southern California. 130 pp. - 1980].At the second step, a local feature of interest is selected by processing the Laws textural energy characteristics [Laws K. Textured image segmentation, Ph.D. dissertation, University of Southern California. 130 pp. - 1980].

Для вычисления энергетических характеристик используется набор из 22 масок размерами 5x5. Затем энергетические характеристики каждого пиксела анализируемого изображения представляются в виде вектора и 9 чисел. Для вычисления масок используются следующие векторы:To calculate the energy characteristics, a set of 22 masks with dimensions 5x5 is used. Then the energy characteristics of each pixel of the analyzed image are presented in the form of a vector and 9 numbers. The following vectors are used to calculate masks:

$L 5 (L e v e l) = [\begin{matrix} 1 & 4 & 6 & 4 & 1 \end{matrix}]$

L 5 (L e v e l) = [\begin{matrix} one & four & 6 & four & one \end{matrix}]

$E 5 (E d g e) = [\begin{matrix} - 1 & - 2 & 0 & 2 & 1 \end{matrix}]$

E 5 (E d g e) = [\begin{matrix} - one & - 2 & 0 & 2 & one \end{matrix}]

$S 5 (S p o t) = [\begin{matrix} - 1 & 0 & 2 & 0 & - 1 \end{matrix}]$

S 5 (S p o t) = [\begin{matrix} - one & 0 & 2 & 0 & - one \end{matrix}]

$E 5 (R i p p l e) = [\begin{matrix} 1 & - 4 & 6 & - 4 & 1 \end{matrix}]$

E 5 (R i p p l e) = [\begin{matrix} one & - four & 6 & - four & one \end{matrix}]

$E 5 (W a v e) = [\begin{matrix} - 1 & 2 & 0 & - 2 & 1 \end{matrix}]$

E 5 (W a v e) = [\begin{matrix} - one & 2 & 0 & - 2 & one \end{matrix}]

Название вектора описывает их назначение. Вектор L5 предназначен для вычисления симметричного взвешенного локального среднего значения. Вектор Е5 предназначен для обнаружения краев, S5 - для обнаружения пятен, R5 - для обнаружения образа в виде ряби, a W5 - для обнаружения волн. Двумерные маски вычисляются умножением пар векторов. Например, для получения маски E5L5 надо умножить вектор Е5 на L5:The name of the vector describes their purpose. Vector L5 is designed to calculate a symmetric weighted local mean value. Vector E5 is for detecting edges, S5 is for detecting spots, R5 is for detecting an image in the form of ripples, and W5 is for detecting waves. Two-dimensional masks are calculated by multiplying pairs of vectors. For example, to get the mask E5L5, you need to multiply the vector E5 by L5:

$[\begin{matrix} - 1 \\ - 2 \\ 0 \\ 2 \\ 1 \end{matrix}] \times [\begin{matrix} 1 & 4 & 6 & 4 & 1 \end{matrix}] = [\begin{matrix} - 1 & - 4 & - 6 & - 4 & 1 \\ - 2 & - 8 & - 12 & - 8 & - 2 \\ 0 & 0 & 0 & 0 & 0 \\ 2 & 8 & 12 & 8 & 2 \\ 1 & 4 & 6 & 4 & 1 \end{matrix}]$

[\begin{matrix} - one \\ - 2 \\ 0 \\ 2 \\ one \end{matrix}] \times [\begin{matrix} one & four & 6 & four & one \end{matrix}] = [\begin{matrix} - one & - four & - 6 & - four & one \\ - 2 & - 8 & - 12 & - 8 & - 2 \\ 0 & 0 & 0 & 0 & 0 \\ 2 & 8 & 12 & 8 & 2 \\ one & four & 6 & four & one \end{matrix}]

Каждая текстурная энергетическая карта является полноразмерным изображением, которое представляет результаты обработки входного изображения с использованием k-й маски.Each texture energy map is a full-sized image that represents the results of processing the input image using the kth mask.

После получения 22 энергетических карт некоторые симметричные пары комбинируются и в результате строятся 13 окончательных карт. Каждая симметричная пара карт заменяется усредненной картой. Например, маска E5L5 характеризует содержание горизонтальных краев, L5E5 - вертикальных. Среднее двух этих карт будет характеризовать наличие на изображении краев обоих этих типов. Перечислим 13 окончательных энергетических карт: L5E5/E5L5, L5S5/S5L5, W5W5, L5W5/W5L5, L5R5/R5L5, Е5Е5, ESWSIWSES, E5S5/S5E5, E5R5/R5E5, RSWSIW5RS, S5S5, S5R5/R5S5, R5R5.After receiving 22 energy cards, some symmetric pairs are combined and as a result 13 final cards are built. Each symmetric pair of cards is replaced by an average card. For example, the mask E5L5 characterizes the content of horizontal edges, Leller - vertical. The average of these two cards will characterize the presence on the image of the edges of both of these types. We will list 13 final energy cards: Leller / E5L5, L5S5 / S5L5, W5W5, L5W5 / W5L5, L5R5 / R5L5, E5Е5, ESWSIWSES, E5S5 / Seller, E5R5 / Raser, RSWSIW5RS, S5S5, S5R5555S5R55555S5R5555S5R55S5R55S5R55S5R55S5R55S5R55S5R55S5R55S5R5555S5R555S5R5555S5R5555S5R55S5R55555S5R55S5R555555

Таким образом, используя фильтры Лавса, для любого пикселя получаем описание характеристик текстуры в некоторой окрестности выбранного пикселя.Thus, using Loves filters, for any pixel we get a description of the texture characteristics in a certain neighborhood of the selected pixel.

На третьем шаге полученное изображение обрабатывается методом SURF. Метод SURF позволяет находить локальные точки с помощью матрицы Гессе и хорошо детектирует пятна, углы и края линий. Гессиан инвариантен относительно вращения, но не инвариантен масштабу. Поэтому метод SURF использует разномасштабные фильтры для нахождения гессианов.In the third step, the resulting image is processed by the SURF method. The SURF method allows you to find local points using the Hessian matrix and well detects spots, corners and edges of lines. Hessian is rotation invariant, but not scale invariant. Therefore, the SURF method uses multiscale filters to find the Hessians.

Для каждой локальной точки считается направление максимального изменения яркости (градиент) и масштаб, взятый из масштабного коэффициента матрицы Гессе. Градиент в точке вычисляется с помощью фильтров Хаара.For each local point, the direction of the maximum brightness change (gradient) and the scale taken from the scale factor of the Hessian matrix are considered. The gradient at the point is calculated using Haar filters.

Для эффективного вычисления фильтров Гессе и Хаара используется интегральное представление изображений.To efficiently calculate the Hessian and Haar filters, the integral representation of the images is used.

Интегральное представление является матрицей, размерность которой совпадает с размерностью исходного изображения, а элементы считаются по формуле:The integral representation is a matrix, the dimension of which coincides with the dimension of the original image, and the elements are calculated by the formula:

$I I (x, y) = \sum_{i = 0, y = 0}^{i \leq x, j \leq y} I (x, y)$

I I (x, y) = \sum_{i = 0 y = 0}^{i \leq x, j \leq y} I (x, y)

где I(i,j) - яркость пикселов исходного изображения.where I (i, j) is the brightness of the pixels of the original image.

Имея интегральную матрицу, можно очень быстро вычислять сумму яркостей пикселов произвольных прямоугольных областей изображения по формуле:Having an integrated matrix, you can very quickly calculate the sum of the brightness of the pixels of arbitrary rectangular areas of the image by the formula:

$S u m O f Re c t = I I (A) + I I (C) - I I (B) - I I (D)$

S u m O f Re c t = I I (A) + I I (C) - I I (B) - I I (D)

где ABCD - интересующий прямоугольник.where ABCD is the rectangle of interest.

Обнаружение особых локальных точек в методе SURF использует бинаризированную аппроксимацию лапласиана гауссиана:Detection of singular local points in the SURF method uses a binarized approximation of the Gaussian Laplacian:

$\det (H_{a p p r o s}) = D_{x x} D_{y y} - {(0.9 D_{x y})}^{2}$

.

\det (H_{a p p r o s}) = D_{x x} D_{y y} - {(0.9 D_{x y})}^{2}

.

Предлагаемый способ позволяет детектировать различные типы локальных особенностей за счет предварительной обработки энергетическими характеристиками Лавса.The proposed method allows to detect various types of local features due to preliminary processing by the energy characteristics of Loves.

Устройство детектирования локальных особенностей на изображении (фиг.2) содержит 2D камеру 1, выход которой подключен к входу блока регистрации изображения 2, выход которого подключен к следующим входам: входу блока построения карт градиентов 3, выход которого подключен к входу блока свертки изображения с ядром обнаружения краев 4.1, к входу блока свертки изображения с ядром обнаружения образа в виде ряби 4.2, к входу блока свертки изображения с ядром обнаружения пятен 4.3, к входу блока свертки изображения с ядром обнаружения волн 4.4, выходы которых подключены к входам блоков поиска локальных особенностей 5.1, 5.2, 5.3, 5.4 соответственно в каждом канале, выходы которых подключены к входам блоков хранения 6.1, 6.2, 6.3, 6.4; синхронность работы устройства обеспечивается генератором тактовых импульсов 7.The device for detecting local features in the image (Fig. 2) contains a 2D camera 1, the output of which is connected to the input of the image registration unit 2, the output of which is connected to the following inputs: the input of the gradient mapping unit 3, the output of which is connected to the input of the image convolution unit with the core edge detection 4.1, to the input of the image convolution unit with the image detection core in the form of ripples 4.2, to the input of the image convolution unit with the spots detection kernel 4.3, to the input of the image convolution unit with the waves detection kernel 4.4, outputs of which ryhs are connected to the inputs of the search blocks of local features 5.1, 5.2, 5.3, 5.4, respectively, in each channel, the outputs of which are connected to the inputs of the storage blocks 6.1, 6.2, 6.3, 6.4; the synchronization of the device is provided by the clock generator 7.

Устройство детектирования локальных особенностей на изображении реализуется следующим образом. Изображение с камеры поступает в блок регистрации изображения, который выполняет роль памяти, далее изображение преобразуется в градации серого в блоке построения карт градиентов и поступает параллельно в четыре канала, в каждом из которых матрица изображения сворачивается с различными ядрами, которые характеризуют текстурные признаки изображения. Полученные текстурные энергетические карты, в свою очередь, поступают параллельно на входы блоков поиска локальных особенностей, в которых происходит поиск локальных особенностей изображения способом детектирования SURF. Результатом работы блоков будет являться набор выделенных локальных особенностей с их геометрическими характеристиками.A device for detecting local features in an image is implemented as follows. The image from the camera enters the image registration unit, which acts as a memory, then the image is converted to grayscale in the gradient mapping unit and enters in parallel into four channels, in each of which the image matrix is collapsed with different cores that characterize the texture features of the image. The resulting texture energy maps, in turn, are sent in parallel to the inputs of the local feature search blocks, in which the local image features are searched for using the SURF detection method. The result of the blocks will be a set of distinguished local features with their geometric characteristics.

Устройство детектирования локальных особенностей на изображении работает следующим образом. Изображение с 2D камеры 1 передается на блок регистрации изображения 2, из которого изображение поступает в блок построения карт градиентов 3, результатом работы в котором является изменение цветного изображения в градации серого. Полученное изображение в градациях серого параллельно поступает на: вход блока 4.1, результатом которого является текстурная энергетическая карта свертки изображения с ядром обнаружения краев; на вход блока 4,2, результатом которого является текстурная энергетическая карта свертки изображения с ядром обнаружения образа в виде ряби; на вход блока 4.3, результатом которого является текстурная энергетическая карта свертки изображения с ядром обнаружения пятен; на вход блока 4.4, результатом которого является текстурная энергетическая карта свертки изображения с ядром обнаружения волн. Полученные текстурные энергетические карты, в свою очередь, поступают параллельно на входы блоков поиска локальных особенностей 5.1, 5.2, 5.3, 5.4 соответственно, в которых происходит поиск локальных особенностей изображения способом детектирования SURF. Набор выделенных локальных особенностей с различными текстурными и геометрическими характеристиками хранится в блоках 6.1, 6.2, 6.3, 6.4.A device for detecting local features in the image works as follows. The image from the 2D camera 1 is transmitted to the image registration unit 2, from which the image enters the gradient mapping block 3, the result of which is a change in the color image in grayscale. The resulting image in grayscale is simultaneously sent to: the input of block 4.1, the result of which is a texture energy map of convolution of the image with the edge detection core; to the input of block 4.2, the result of which is a texture energy map of convolution of the image with the image detection core in the form of ripples; to the input of block 4.3, the result of which is a texture energy map of convolution of the image with the spot detection core; to the input of block 4.4, the result of which is a texture energy map of convolution of the image with the wave detection core. The resulting texture energy maps, in turn, are sent simultaneously to the inputs of the local feature search blocks 5.1, 5.2, 5.3, 5.4, respectively, in which the local features of the image are searched for using the SURF detection method. A set of distinguished local features with different texture and geometric characteristics is stored in blocks 6.1, 6.2, 6.3, 6.4.

Технический результат - детектирование локальных особенностей на изображении с определенными текстурными свойствами.The technical result is the detection of local features in the image with certain texture properties.

Claims

1. A method for detecting local features in an image, which consists in searching for local features using the SURF method, characterized in that the resulting image in the RGB color space is converted to grayscale, and then processed by the Loves energy characteristics using convolution of the source image matrix with detection type kernels image in the form of ripples, spot detection, wave detection, and edge detection, for each received image, local features such as spots are detected, ly, SURF method of finding local line edge point using the Hessian matrix.

2. A device for detecting local features in an image containing a 2D camera, the output of which is connected to the input of the image registration unit, the output of which is connected to the input of the gradient mapping block; characterized in that the device contains four image processing channels by convolution of the matrix of the specified image, the output of the gradient mapping block is connected to the inputs of the feature blocks characterizing the convolution of the image matrix with the cores whose outputs are connected to the inputs of the local feature search blocks in each channel, respectively, the outputs of which connected to the inputs of the storage units, synchronization of all units of the device is carried out by a clock generator.