RU2174710C1

RU2174710C1 - Method for automatic enhancing of half-tone image

Info

Publication number: RU2174710C1
Application number: RU2000106572A
Authority: RU
Inventors: А.Н. Аниконов; С.П. Белоконь; А.Е. Мадонов; В.Г. Ковалев
Original assignee: Общевойсковая Академия Вооруженных Сил Российской Федерации
Priority date: 2000-03-20
Filing date: 2000-03-20
Publication date: 2001-10-10

Abstract

FIELD: digital procession of images. SUBSTANCE: method is based upon determination of unimodal or bimodal type of initial brightness bar chart and threshold level of brightness having property of dividing bimodal bar chart by two unimodal fragments; subsequent processing of bar chart fragments and reverse transition from bar chart fragments to image segments. Method comprises steps of automatic approximation of stepped initial brightness bar chart by means of approximation curve; plotting curve of dynamics of bar chart center; detecting brightness intervals for which brightness value of approximation curve is less than respective brightness values for dynamics curve of bar chart center or equal to them; calculating weight of division zone for each brightness interval; identifying brightness interval of division zone with maximum weight; comparing it with standard one; deciding that initial brightness bar chart is bimodal one when first of said values is more than second one; detecting division zone; equalizing remaining fragments with standard distribution and filtering them. EFFECT: improved reliability and noise stability of image enhancing process without operator. 4 dwg

Description

Изобретение относится к способам цифровой обработки изображений. The invention relates to digital image processing methods.

В настоящее время в процедуре улучшения изображений с целью приведения его к виду, удобному для восприятия или машинного анализа, могут применяться различные способы цифровой обработки изображений. Currently, in the procedure for improving images in order to bring it to a form convenient for perception or machine analysis, various methods of digital image processing can be applied.

Известен способ - сегментация (см., например, Прэтт У. Цифровая обработка изображений: Пер. с англ. - М: Мир, 1982. - Кн.2. - С. 555-570; Фу К. Структурные методы в распознавании образов: Пер с англ. - М.: Мир, 1977. - С. 20-25; патент США N 4574393 (G 06 K 9/00, 04.03.1986; всего 25 с.)). Суть последнего заключается в определении унимодального или бимодального типа исходной яркостной гистрограммы и порогового уровня яркости, обладающего свойством разделения бимодальной гистограммы на два унимодальных фрагмента, с последующей обработкой фрагментов гистограммы, а также в обратном переходе от фрагментов гистограммы к сегментам изображения. A known method is segmentation (see, for example, Pratt W. Digital image processing: Translated from English. - M: Mir, 1982. - Book 2. - P. 555-570; Fu K. Structural methods in pattern recognition: Translated from English - M .: Mir, 1977. - S. 20-25; U.S. Patent N 4,574,393 (G 06 K 9/00, 03/04/1986; 25 p. Total)). The essence of the latter is to determine the unimodal or bimodal type of the initial luminance histogram and the threshold brightness level, which has the property of dividing the bimodal histogram into two unimodal fragments, with subsequent processing of the histogram fragments, as well as the reverse transition from histogram fragments to image segments.

Существует также способ автоматического улучшения полутонового изображения, в котором фрагменты полутонового изображения эквализируют под нормальное распределение под управлением бинарной маски и проводят их высокочастотную фильтрацию (патент США N 5854851, G 06 K 9/00, 29.12.1998; всего 35 с.: см. столбцы 24-26 и фиг. 8D). There is also a method for automatically improving a grayscale image in which fragments of a grayscale image are equalized to a normal distribution under the control of a binary mask and carry out their high-pass filtering (US patent N 5854851, G 06 K 9/00, 12/29/1998; only 35 pp .: see columns 24-26 and Fig. 8D).

Недостатком данных способов улучшения изображения является остающееся негативное влияние отдельных сегментов на результаты обработки изображения в целом и отсутствие гарантированного качества результатов, в значительной степени зависящих от содержания изображения и цели наблюдателя, свидетельствующие о том, что гарантированное улучшение невозможно без реализации аппарата понимания наблюдаемой сцены в аспекте решаемой наблюдателем задачи. The disadvantage of these image enhancement methods is the remaining negative impact of individual segments on the image processing results as a whole and the lack of guaranteed quality of the results, which largely depend on the image content and the observer’s goals, indicating that guaranteed improvement is impossible without the implementation of an apparatus for understanding the observed scene in aspect problem solved by the observer.

Задачей настоящего изобретения является повышение надежности и помехоустойчивости улучшения изображения и исключение из процесса улучшения человека. The objective of the present invention is to increase the reliability and noise immunity of image enhancement and the exclusion from the process of human improvement.

Указанная задача решается тем, что с исходным полутоновым изображением последовательно производятся автоматическая сегментация по форме яркостной гистограммы, эквализация под нормальное распределение каждого из сегментов исходного изображения под управлением бинарной маски и высокочастотная фильтрация эквализированного изображения под управлением бинарной маски. This problem is solved by the fact that with the initial grayscale image, automatic segmentation is sequentially performed according to the shape of the brightness histogram, equalization to the normal distribution of each of the segments of the original image under the control of the binary mask, and high-pass filtering of the equalized image under the control of the binary mask.

Сущность изобретения заключена в автоматическом улучшении качества визуального восприятия полутонового изображения в результате использования сегментации, нормальной эквализации и высокочастотной фильтрации. При этом весь процесс улучшения полутонового изображения заключается в: аппроксимировании ступенчатой исходной яркостной гистограммы соответствующей аппроксимирующей кривой на основе аппроксимирующего полинома, построении кривой динамики центра гистограммы, определении тех яркостных интервалов, для всех точек которых значения яркости для аппроксимирующей кривой меньше или равны соответствующим значениям яркости для кривой динамики центра гистограммы, вычислении веса области разделения для каждого из найденных яркостных интервалов как суммы разностей, вычисленных на каждом яркостном интервале, между соответствующими значениями ординат кривой динамики центра гистограммы и аппроксимирующей кривой, идентифицировании яркостного интервала, соответствующего области разделения с максимальным весом, сравнения значения максимального веса области разделения и нормативного веса, при превышении первого из этих значений над вторым принимается решение о бимодальном типе исходной яркостной гистрограммы и находится глобальный минимум аппроксимирующего полинома на яркостном интервале с максимальным весом области разделения, который и принимают в качестве упомянутого порогового уровня яркости для обеспечения операции порогового среза исходного изображения управляющей бинарной маской, из которой при обратном переходе от фрагментов гистрограммы к сегментам изображения удаляют шум, разделение (декомпозиция) исходного изображения на определенные части (сегменты), различающиеся по своему содержанию, последующей эквализации под нормальное распределение полученных сегментов под управлением бинарной маски и высокочастотной фильтрации эквализированного изображения под управлением бинарной маски. The essence of the invention lies in the automatic improvement of the quality of visual perception of a grayscale image as a result of the use of segmentation, normal equalization and high-pass filtering. Moreover, the whole process of improving the grayscale image consists in: approximating the stepwise initial brightness histogram of the corresponding approximating curve based on the approximating polynomial, plotting the dynamics curve of the center of the histogram, determining those brightness intervals for all points of which the brightness values for the approximating curve are less than or equal to the corresponding brightness values for the histogram center dynamics curve, calculating the weight of the separation region for each of the found brightness intervals fishing as the sum of the differences calculated on each luminance interval between the corresponding ordinates of the histogram center dynamics curve and the approximating curve, identifying the luminance interval corresponding to the separation region with the maximum weight, comparing the maximum weight of the separation region and the normative weight, if the first of these values is exceeded above the second, a decision is made on the bimodal type of the initial brightness histogram and the global minimum of the approximating polynomial per jar is found the remaining interval with the maximum weight of the separation region, which is taken as the mentioned threshold brightness level to provide the threshold cutoff operation of the original image with a control binary mask, from which, upon the reverse transition from histogram fragments to image segments, noise is removed, separation (decomposition) of the original image into certain parts (segments) that differ in their content, subsequent equalization to the normal distribution of the obtained segments under binary control asci and highpass filtering equalized image under control of the binary mask.

Результатом является получаемое улучшение качества визуального восприятия изображения. The result is the resulting improvement in the quality of visual perception of the image.

Изобретение поясняется чертежом. Способ автоматического улучшения полутонового изображения. На чертеже представлены:
Фото 1 (слева, вверху) - исходное изображение без применения способа автоматического улучшения полутонового изображения, а под ним показана гистрограмма 1, показывающая внешний вид исходной яркостной бимодальной гистограммы.The invention is illustrated in the drawing. A way to automatically improve the grayscale image. The drawing shows:
Photo 1 (top left) is the original image without using the automatic grayscale image enhancement method, and histogram 1 is shown below it, showing the appearance of the initial brightness bimodal histogram.

Фото 2 (справа) - изображение, получаемое при применении способа автоматического улучшения полутонового изображения, а также гистограмма 2, дающая представление о повышении удельного веса высокочастотных составляющих в пространственно-частотном спектре изображения (после применения способа автоматического улучшения полутонового изображения). Photo 2 (right) is the image obtained by applying the method of automatic halftone image enhancement, as well as histogram 2, which gives an idea of increasing the specific gravity of high-frequency components in the spatial frequency spectrum of the image (after applying the method of automatic halftone image enhancement).

Задача разделения (декомпозиции) исходного изображения на определенные части (сегменты) по сходству своих точек осуществляется посредством сегментации (см., например, Прэтт У. Цифровая обработка изображений: Пер. с англ. - М. : Мир, 1982. - Кн. 2. - С. 555-570; Фу К. Структурные методы в распознавании образцов: Пер. с англ. - М.: Мир, 1977. - С. 20-25; Положительное решение по заявке N 98113421/09 от 10.07.98). В ее результате получается разделение (декомпозиция) исходного изображения на определенные части (сегменты), различающиеся по своему содержанию. The task of dividing (decomposing) the original image into specific parts (segments) by the similarity of their points is carried out by means of segmentation (see, for example, Pratt W. Digital image processing: Transl. From English. - M.: Mir, 1982. - Book 2 - S. 555-570; Fu K. Structural methods in pattern recognition: Translated from English - Moscow: Mir, 1977. - P. 20-25; Positive decision on application N 98113421/09 of 07/10/98) . As a result of it, a separation (decomposition) of the original image into certain parts (segments) that differ in their content is obtained.

Для решения задачи исключения необходимо сначала выделить на изображении сегменты "местность" и "небо". Исследования показали, что такую сегментацию можно осуществить посредством визуального анализа внешнего вида исходной яркостной бимодальной гистограммы на два унимодальных фрагмента. Оценка точки (на оси абсцисс), расположенной между модальными образованиями и максимально удаленной от обоих, позволяет назначить яркостный уровень, разделяющий с приемлемой точностью пикселы разных сегментов. Если по этому уровню осуществить пороговый срез исходного изображения, то в результате получим бинарную маску, в которой пикселы сегментов "небо" и "местность" отмаркированы соответственно нулем и единицей. To solve the elimination problem, you must first select the "terrain" and "sky" segments in the image. Studies have shown that such segmentation can be accomplished by visual analysis of the appearance of the initial brightness bimodal histogram into two unimodal fragments. Evaluation of a point (on the abscissa axis) located between the modal formations and as far as possible from both allows you to assign a brightness level that separates pixels of different segments with acceptable accuracy. If a threshold cut of the original image is carried out at this level, then as a result we get a binary mask in which the pixels of the sky and terrain segments are marked with zero and one, respectively.

В случае унимодального исходного яркостного распределения и увеличения яркостного диапазона информативно значимого фрагмента (например, сегмент: "местность"), после исключения из обработки присутствующего в кадре информативно незначимого фрагмента повышенной яркости, соответствующего небесному своду (например, сегмент: "небо"), необходимо воспользоваться преимуществами нормальной эквализации. In the case of a unimodal initial luminance distribution and an increase in the luminance range of an informatively significant fragment (for example, segment: "terrain"), after excluding from the processing an informatively insignificant fragment of increased brightness corresponding to the vault (for example, segment: "sky"), it is necessary Take advantage of normal equalization.

Процедура эквализации (см., например, Прэтт У. Цифровая обработка изображений. - М.: Мир, 1982. - Кн. 2. - С. 555-570; Структурные методы в распознавании образов; Пер. с англ. - М.: Мир, 1977. - С. 20-25) рассматривается как монотонное поэлементное преобразование
b = z(a), (1.1)
где a = [0,255] - исходное значение яркости;
b = [0,255] - выходное значение яркости.The equalization procedure (see, for example, Pratt W. Digital image processing. - M .: Mir, 1982. - Book 2. - P. 555-570; Structural methods in pattern recognition; Transl. From English. - M .: Mir, 1977. - S. 20-25) is considered as a monotonous element-wise transformation
b = z (a), (1.1)
where a = [0.255] is the initial brightness value;
b = [0.255] - output brightness value.

Распределение относительных частот h_a(a) переходит в распределение относительных частот h_b(b).The distribution of relative frequencies h _a (a) goes into the distribution of relative frequencies h _b (b).

Кроме того, вероятность того, что яркость элементов исходного изображения

меньше или равна a, должна равняться вероятности того, что яркость элементов выходного изображения будет меньше или равна b = z(a):

где i, j - уровни яркости элементов соответственно исходного и выходного изображений.In addition, the likelihood that the brightness of the elements of the original image

less than or equal to a, should be equal to the probability that the brightness of the elements of the output image will be less than or equal to b = z (a):

where i, j are the brightness levels of the elements of the source and output images, respectively.

Если дискретное распределение для b заменить плотностью непрерывного, получаем приближенную форму записи для (1.3)

где a - исходное значение яркости,
b - получаемое значение яркости,
h_a(i) - исходная гистограмма,
P_b(b) - плотность требуемого распределения.If the discrete distribution for b is replaced by the density of a continuous one, we obtain an approximate form of writing for (1.3)

where a is the initial brightness value,
b is the resulting brightness value,
h _a (i) is the initial histogram,
P _b (b) is the density of the desired distribution.

В случае эквализации под нормальное распределение имеем

где m и σ - задаваемые параметры требуемого нормального распределения (математическое ожидание и среднее квадратическое отклонение).In the case of equalization to the normal distribution, we have

where m and σ are the specified parameters of the required normal distribution (mathematical expectation and standard deviation).

Тогда:

где erf(τ) = -erf(-τ)- табулированная функция ошибок (Г. Корн. Справочник по математике для научных работников и инженеров. - М.: Наука, 1984, - 831 с.).Then:

where erf (τ) = -erf (-τ) is the tabulated error function (G. Korn. Handbook of mathematics for scientists and engineers. - M .: Nauka, 1984, - 831 p.).

Для каждого значения правой части соотношения (1.6) с помощью таблиц erf можно определить значение аргумента функции ошибок Z_erf(a), при котором выполняется данное равенство. Итоговое выражение, связывающее входные и выходные уровни яркости при нормальной эквализации, имеет вид

В соответствии с правилом "трех сигма" при m = 128 и σ = 42 обеспечивается максимальное использование яркостной чувствительности глаза. В принципе, равномерную эквализацию можно рассматривать как частный случай нормальной при соответствующей регулировке σ.For each value of the right-hand side of relation (1.6), using the tables erf, we can determine the value of the argument of the error function Z _erf (a) at which this equality holds. The final expression connecting the input and output brightness levels at normal equalization has the form

In accordance with the “three sigma” rule, with m = 128 and σ = 42, the maximum use of the brightness sensitivity of the eye is ensured. In principle, uniform equalization can be considered as a special case of normal with appropriate adjustment of σ.

При тестировании высокочастотных фильтров было установлено, что наиболее значительное и стабильное повышение качества изображений наблюдается в результате действия фильтра, дискретная аппроксимация импульсного отклика которого определяется маской

Данную маску можно записать в виде

где K₁ = 1/2.When testing high-frequency filters, it was found that the most significant and stable increase in image quality is observed as a result of the filter, a discrete approximation of the impulse response of which is determined by the mask

This mask can be written as

where K ₁ = 1/2.

Здесь первое слагаемое представляет импульсный отклик, не изменяющий исходных яркостей, а второе - учитываемую с весом K₁ дискретную аппроксимацию лапласиана. Таким образом, повышение качества достигается операцией
G₁(n_x,n_y) = G(n_x,n_y) + K₁ • L₁(n_x,n_y), (1.10)
где n_x, n_y - дискретные пространственные координаты,
G(n_x,n_y) - исходное изображение,
G₁(n_x,n_y) - результат обработки,
L₁(n_x,n_y) - дискретная аппроксимация лапласиана,
K₁ - весовой коэффициент.Here, the first term is the impulse response that does not change the initial brightnesses, and the second is the discrete approximation of the Laplacian taken into account with a weight of K ₁ . Thus, quality improvement is achieved by operation
G ₁ (n _x , n _y ) = G (n _x , n _y ) + K ₁ • L ₁ (n _x , n _y ), (1.10)
where n _x , n _y are discrete spatial coordinates,
G (n _x , n _y ) is the original image,
G ₁ (n _x , n _y ) is the result of processing,
L ₁ (n _x , n _y ) is the discrete approximation of the Laplacian,
K ₁ - weight coefficient.

Т. е. благодаря данной операции повышается удельный вес высокочастотных составляющих в пространственно-частотном спектре изображения, т.к. известно, что оператор Лапласа выделяет высокочастотные перепады яркости без учета их ориентации (см. , например, Прэтт У. Цифровая обработка изображений. - М.: Мир, 1982. - С. 20). Следует отметить, что используемая маска

является оптимальной дискретной аппроксимацией лапласиана на квадратной решетке (см. , например, Хорн Б.К.П. Зрение роботов. - М.: Мир, 1989. - 487 с. ), наиболее предпочтительной для систем цифровой обработки изображений. Применение гексагональной и других видов решеток двумерной дискретизации вызывает значительные трудности при реализации операции свертки и унитарных преобразований. Что касается весового коэффициента K₁, то, как следует из (1.9) и (1.10), операция (1.11) наиболее результативна при K₁=1/2. В принципе, в реальных системах цифровой обработки изображений вес K₁ может быть управляемым параметром. В простейшем случае это ручная настройка значения K₁ оператором, ведущим наблюдение. Также следует отметить, что на значение K₁ оказывает влияние размер зерна разрушения конкретной отображающей системы, т. к. точная запись маски (1.10) содержит коэффициент не 1/6, а 1/6e², где e - расстояние между центрами соседних пикселов.That is, thanks to this operation, the specific gravity of high-frequency components in the spatial-frequency spectrum of the image increases, because it is known that the Laplace operator selects high-frequency differences in brightness without regard to their orientation (see, for example, Pratt W. Digital image processing. - M .: Mir, 1982. - P. 20). It should be noted that the mask used

is the optimal discrete approximation of the Laplacian on a square lattice (see, for example, Horn B.K.P. Vision of robots. - M .: Mir, 1989. - 487 p.), the most preferred for digital image processing systems. The use of hexagonal and other types of gratings of two-dimensional discretization causes considerable difficulties in the implementation of the operation of convolution and unitary transformations. As for the weight coefficient K ₁ , then, as follows from (1.9) and (1.10), operation (1.11) is most effective for K ₁ = 1/2. In principle, in real digital image processing systems, the weight of K ₁ can be a controlled parameter. In the simplest case, this is a manual adjustment of the value of K _{1 by the} operator conducting the observation. It should also be noted that the value of K ₁ is affected by the grain size of the destruction of a particular imaging system, since the exact recording of the mask (1.10) does not contain a factor of 1/6, but 1 / 6e ² , where e is the distance between the centers of neighboring pixels.

Предложенный способ позволяет получить положительный эффект, заключающийся в полной автоматизации процесса улучшения полутонового изображения. The proposed method allows to obtain a positive effect, which consists in the complete automation of the process of improving the grayscale image.

Апробирование этого способа показало его высокую эффективность при обработке изображений объектов на естественных фонах. Testing of this method has shown its high efficiency in processing images of objects on natural backgrounds.

Claims

A method for automatically improving a grayscale image, which consists in determining the unimodal or bimodal type of the initial luminance histogram and a threshold brightness level having the property of dividing the bimodal histogram into two unimodal fragments, with subsequent processing of the histogram fragments, as well as the inverse transition from the histogram fragments to image segments, which differs the fact that in the process of improving the image, the stepwise initial brightness luminance gis is automatically approximated ograms of the corresponding approximating curve based on the approximating polynomial, plotting the dynamics curve of the center of the histogram, determining those brightness intervals for all points of which the brightness values for the approximating curve are less than or equal to the corresponding brightness values for the dynamics curve of the center of the histogram, calculating the weight of the separation region for each of the found brightness intervals as the sum of the differences calculated on each brightness interval between the corresponding ordinates of the dynamics curve histogram and approximating curve, identifying the brightness interval corresponding to the separation region with the maximum weight, comparing the maximum weight of the separation region and the normative weight, deciding on the bimodal type of the initial brightness histogram when the first of these values exceeds the second, finding the global minimum of the approximating polynomial by luminance interval with the maximum weight of the separation region, which is taken as a threshold brightness level to provide perazim original image cutoff threshold managing binary mask, and after the elimination of processing informative fragments insignificant equalizes fragments obtained under the normal distribution, and their high-pass filtering is carried out.