KR100206338B1

KR100206338B1 - Binary coding method of letter image having both character and picture

Info

Publication number: KR100206338B1
Application number: KR1019970004640A
Authority: KR
Inventors: 김지형
Original assignee: 윤종용; 삼성전자주식회사
Priority date: 1997-02-17
Filing date: 1997-02-17
Publication date: 1999-07-01
Also published as: KR19980068158A

Abstract

가. 청구범위에 기재된 발명이 속한 기술분야end. The technical field to which the invention described in the claims belongs

화상입력장치에서 문자와 그림이 혼재된 문서를 스캔하여 얻어진 화상을 이치화하는 방법에 관한 것이다.The present invention relates to a method of binarizing an image obtained by scanning a document in which characters and pictures are mixed in an image input apparatus.

나. 발명이 해결하고자 하는 기술적 과제I. The technical problem to be solved by the invention

노이즈 성분이 존재하는 문자와 그림이 혼재된 화상에 대하여도 화질 및 압축율을 향상시킬 수 있는 이치화방법을 제공한다.Provided is a binarization method capable of improving image quality and compression ratio even for an image in which characters and pictures with noise components are mixed.

다. 발명의 해결방법의 요지All. Summary of Solution of the Invention

스캔 화상의 주목화소에 대한 에지의 크기 및 방향성을 검출하고, 에지의 크기를 미리 설정된 역치와 비교하여 주목화소가 문자의 경계부분인 경우 방향성에 따른 에지의 연속성 여부를 조사하며, 에지의 연속성이 있는 경우에는 주목화소를 같은 방향의 이전 화소에 대한 이치화값과 동일한 값으로 이치화하고, 에지의 연속성이 없는 경우에는 국부 역치를 적용하여 주목화소를 이치화한다.Detects the size and orientation of the edge of the pixel of interest in the scanned image, compares the size of the edge with a preset threshold, and examines the continuity of the edge according to the orientation when the pixel of interest is the boundary of the character. If present, the pixel of interest is binarized to the same value as the binarization value for the previous pixel in the same direction. If there is no edge continuity, the pixel of interest is binarized by applying a local threshold.

라. 발명의 중요한 용도la. Important uses of the invention

문자와 그림이 혼재된 문서화상을 이치화하는데 이용한다.It is used to binarize document images with mixed text and pictures.

Description

How to binarize text and pictures

본 발명은 화상입력장치에 관한 것으로, 특히 문자와 그림이 혼재된 문서를 스캔(scan)하여 얻어진 화상을 이치화하는 방법에 관한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image input device, and more particularly, to a method for binarizing an image obtained by scanning a document in which characters and pictures are mixed.

통상적으로 팩시밀리(facsimile), 스캐너(scanner), 디지털 복사기등과 같은 화상입력장치는 문서를 스캔하여 얻어진 아나로그 화상신호를 디지털 화상데이터로 변환한후 처리한다. 이때 아나로그 화상신호는 화소마다 원래의 문서에 대응하는 계조를 가지는 디지털 화상데이터로 변환되는데, 계조를 가지게 되므로 데이터량이 아주 많게 된다. 예를들어 256단계의 계조를 가지는 경우 각 화소는 8비트로 표현된다. 이에따라 통상적으로 스캔에 따른 화상데이터를 이치화하므로써 데이터량을 대폭적으로 줄인 다음에 부호화한후 송신하거나 복사 문서를 출력한다. 상기 이치화는 계조를 가지는 각 화소를 백화소 또는 흑화소로 결정하는 것으로, 백화소와 흑화소는 상대적으로 1 또는 0의 1비트 데이터로 표현된다.Typically, an image input apparatus such as a facsimile, scanner, digital copier, or the like converts an analog image signal obtained by scanning a document into digital image data and processes it. At this time, the analog image signal is converted to digital image data having a gradation corresponding to the original document for each pixel. Since the gradation has a gradation, the amount of data becomes very large. For example, in the case of 256 gray levels, each pixel is represented by 8 bits. Accordingly, by binarizing the image data according to the scan, the amount of data is drastically reduced and then encoded and transmitted or a copy document is output. The binarization determines each pixel having a gray level as a white pixel or a black pixel, and the white pixel and the black pixel are represented by one bit data of 1 or 0 relatively.

도 1은 상기한 화상입력장치의 예로서 통상적인 스캐너의 블록구성도를 보인 것이다. 중앙처리장치(100)는 원고감지센서(112)에 의해 원고 장착을 감지하고 조작 패널(operational panel)(110)로부터 스캔 시작명령이 입력될 때 이미지 프로세서(104)를 제어하여 원고를 스캔한다. 스캔 유니트(102)는 원고를 스캔하기 위한 광학계를 포함하며 원고를 스캔하여 아나로그 화상신호를 발생한다. 이미지 프로세서(104)는 스캔 유니트(102)를 구동하며 스캔 유니트(102)로부터 출력되는 화상신호를 디지털 데이터로 변환하고 이미지 처리 및 이치화한다. 롬(ROM: Read Only Memory)(106)에는 중앙처리장치(100)의 동작 프로그램과 각종 참조데이터가 미리 저장된다. 램(RAM: Random Access Memory)(108)은 중앙처리장치(100)의 동작 수행에 따른 데이터가 일시 저장된다. 조작패널(110)은 중앙처리장치(100)에 사용자에 의한 키입력을 제공한다. 원고감지센서(112)는 원고가 장착되는 것을 감지하여 중앙처리장치(100)에 알린다.1 shows a block diagram of a conventional scanner as an example of the image input apparatus. The central processing unit 100 detects the document mounting by the document detection sensor 112 and controls the image processor 104 to scan the document when a scan start command is input from the operation panel 110. The scanning unit 102 includes an optical system for scanning an original and scans the original to generate an analog image signal. The image processor 104 drives the scan unit 102 to convert the image signal output from the scan unit 102 into digital data, and to process and binarize the image. The ROM (Read Only Memory) 106 stores the operation program of the CPU 100 and various reference data in advance. The random access memory (RAM) 108 temporarily stores data according to an operation of the CPU 100. The operation panel 110 provides a key input by the user to the CPU 100. The document detection sensor 112 detects that the document is mounted and notifies the central processing unit 100.

상기한 스캐너와 같은 화상입력장치에서 이치화는 문서화상의 종류에 따라 각각 다른 방법을 채용한다. 상기 문서화상의 종류는 문자로 이루어진 문자 문서, 그림(또는 사진)으로 이루어진 그림 문서, 그리고 문자와 그림이 혼재된 문서로 구분되어진다.In an image input apparatus such as the above-described scanner, binarization adopts a different method depending on the type of document image. The document image is classified into a text document consisting of text, a picture document consisting of a picture (or a picture), and a document in which text and pictures are mixed.

이들중 문자와 그림이 혼재된 화상을 이치화하는 종래의 방법에 따른 이미지 프로세서(104)의 흐름도를 도 2로서 도시하였다. 먼저 이미지 프로세서(104)는 도 2의 (200)∼(204)단계에서 현재 처리할 화소인 주목화소의 주변화소값 중에서 최대값과 최소값을 구하여 이 값의 차, 즉 국부밝기차를 결정하고 국부밝기차를 미리 설정된 역치(threshold) Tdif와 비교한다. 이때 국부밝기차가 역치 Tdif보다 크면 이미지 프로세서(104)는 주목화소가 문자의 경계인 것으로 인식하여 (206)∼(210)단계에서 국부 역치(local threshold)를 적용하여 이치화한다.2 is a flowchart of an image processor 104 according to a conventional method of binarizing images in which characters and pictures are mixed. First, the image processor 104 determines the difference between these values, that is, the local brightness difference, by obtaining the maximum value and the minimum value among the peripheral pixel values of the pixel of interest, which are the pixels to be processed, in steps (200) to (204) of FIG. The brightness difference is compared with a preset threshold Tdif. At this time, if the local brightness difference is larger than the threshold Tdif, the image processor 104 recognizes that the pixel of interest is the boundary of the character and binarizes it by applying a local threshold in steps 206 to 210.

이와 달리 국부밝기차가 역치 Tdif보다 작으면 이미지 프로세서(104)는 (212)∼(228)단계에서 주목화소가 문자내부나 배경 또는 그림부분인 것으로 인식하여 각 경우마다 개별적으로 처리한다. 즉, 이미지 프로세서(104)는 (212)단계에서 주목화소값이 미리 설정된 최대역치값 Tmax보다 크면 배경인 것으로 인식하여 (214)단계에서 주목화소를 백으로 이치화한다. 이와 달리 (216)단계에서 주목화소값이 미리 설정된 최소역치값 Tmin보다 작으면 이미지 프로세서(104)는 주목화소가 문자내부인 것으로 인식하여 (218)단계에서 주목화소를 흑으로 이치화한다.On the other hand, if the local brightness difference is smaller than the threshold Tdif, the image processor 104 recognizes that the pixel of interest is the inside of the character, the background, or the picture part in steps 212 to 228, and processes each case separately. That is, the image processor 104 recognizes that the pixel of interest is a background when the pixel value of interest is greater than the preset maximum threshold value Tmax in step 212 and binarizes the pixel of interest to white in step 214. On the contrary, if the pixel value of interest is smaller than the preset minimum threshold value Tmin in step 216, the image processor 104 recognizes that the pixel of interest is inside the character and binarizes the pixel of interest in black in step 218.

만일 상기한 어떤 경우에도 해당하지 않으면 이미지 프로세서(104)는 주목화소가 일단 그림부분인 것으로 인식하고 노이즈 성분인지 여부를 확인하기 위해 (220)단계에서 그림의 연속성이 있는가를 검사한다. 이때 그림영역에 존재하는 화소는 독립적으로 존재할 수 없다는 전제 아래 영역의 연속성을 조사한다. 이와 같이 연속성을 조사하는 방법으로는 도 3에 보인 윈도우를 사용한다. 상기 도 3에 보인 윈도우는 i번째 라인의 j열의 화소 x를 주목화소라 할 때, 빗금친 주변화소들, 즉 인접한 i-1번째 라인의 j-1열, j열, j+1열 각각의 화소와 i번째 라인의 j-1열의 화소들이 그림영역에 존재한다고 할 때, 주목화소 역시 그림영역에 속한다고 판단한다. 이때 그림의 연속성이 존재하면 이미지 프로세서(104)는 (222)단계에서 디더링(dithering)이나 오차확산등과 같은 중간조표현을 적용하여 주목화소를 이치화한다. 그리고 그림의 연속성이 없으면 이미지 프로세서(104)는 (224)∼(228)단계에서 전역 역치(global threshold)를 적용하여 이치화한다. 상기 (224)단계에서 이미지 프로세서(104)는 주목화소값을 전역 역치와 비교하여, 주목화소값이 전역 역치보다 크면 주목화소를 백화소로 이치화하고 주목화소값이 전역 역치보다 작으면 주목화소를 흑화소로 이치화한다.If none of the above is true, the image processor 104 recognizes that the pixel of interest is once a picture portion and checks whether there is continuity of the picture in step 220 to determine whether it is a noise component. At this time, the continuity of the area is examined under the premise that the pixels in the picture area cannot exist independently. Thus, the window shown in FIG. 3 is used as a method of examining continuity. In the window shown in FIG. 3, when the pixel x in the j column of the i-th line is the pixel of interest, the shaded peripheral pixels, that is, the j-1, j, j + 1 columns of the adjacent i-1th line, respectively When pixels and pixels in column j-1 of the i-th line exist in the picture area, it is determined that the pixel of interest also belongs to the picture area. If the continuity of the picture exists, the image processor 104 binarizes the pixel of interest in step 222 by applying an intermediate tone expression such as dithering or error diffusion. If there is no continuity of the picture, the image processor 104 binarizes by applying a global threshold in steps 224 to 228. In step 224, the image processor 104 compares the pixel value of the pixel with the global threshold value, binarizes the pixel of interest with the white pixel if the pixel value of interest is greater than the global threshold, and blackens the pixel of interest if the pixel value of interest is smaller than the global threshold. Reason with cows.

상기한 이치화방법에서 주목화소가 문자의 에지, 즉 문자의 경계부분인지에 대한 판정기준에 관한 부분이 문제가 된다. 즉, 주변화소의 최대값과 최소값의 차이로써 에지를 검출하기 때문에 실제 문자의 에지가 아닌 부분, 즉 노이즈 성분까지 추출하며, 추출된 문자의 에지라고 해도 직선화가 되지 못하는 단점이 있다. 따라서 직선화되지 못한 에지와 노이즈 성분의 에지로 인하여 런장(run-length) 부호화와 같은 데이터 압축을 할 때 압축율이 떨어지므로 데이터 전송시 전송효율이 떨어지고 화질 역시 저하된다.In the binarization method described above, a part regarding the criterion for determining whether the pixel of interest is the edge of the character, that is, the boundary of the character, is problematic. That is, since the edge is detected by the difference between the maximum value and the minimum value of the neighboring pixels, a portion which is not an edge of the actual character, that is, a noise component is extracted, and even though the edge of the extracted character is not straightened. Therefore, due to the non-linearized edges and the edges of noise components, the compression rate decreases when performing data compression such as run-length coding. Therefore, the transmission efficiency decreases and the image quality deteriorates.

상술한 바와 같이 종래의 방법은 에지의 방향성 및 연속성을 고려하지 않고 에지를 검출하여 이치화함에 따라 노이즈 성분이 존재하는 화상에 대하여는 화질이 저하될 뿐만아니라 에지가 직선화되지 못하므로 압축율이 떨어지는 단점이 있었다.As described above, the conventional method has a disadvantage in that the image quality is not only degraded for an image in which noise components exist but also the edges are not linearized as the edge is detected and binarized without considering the directionality and continuity of the edge. .

따라서 본 발명의 목적은 노이즈 성분이 존재하는 문자와 그림이 혼재된 화상에 대하여도 화질 및 압축율을 향상시킬 수 있는 이치화방법을 제공함에 있다.Accordingly, an object of the present invention is to provide a binarization method capable of improving image quality and compression ratio even for an image in which a character and a picture having noise components are mixed.

도 1은 통상적인 스캐너의 블록구성도,1 is a block diagram of a conventional scanner;

도 2는 종래의 처리 흐름도,2 is a conventional processing flow chart,

도 3은 도 2에서 그림영역의 연속성을 판단하는 방법을 보인 예시도,3 is an exemplary view illustrating a method of determining continuity of a picture area in FIG. 2;

도 4는 본 발명의 실시예에 따른 처리 흐름도,4 is a process flow diagram according to an embodiment of the present invention;

도 5는 통상적인 에지의 크기 및 방향성을 검출하는 윈도우 예시도,5 is an exemplary window for detecting the size and direction of a typical edge;

도 6은 본 발명에서 방향성에 따른 연속성을 조사하는 방법을 보인 예시도,Figure 6 is an exemplary view showing a method for investigating continuity according to the direction in the present invention,

도 7은 본 발명에서 국부 역치를 결정하는 방법을 보인 예시도.7 is an exemplary view illustrating a method of determining local thresholds in the present invention.

이하 본 발명의 바람직한 실시예를 첨부한 도면을 참조하여 상세히 설명한다. 하기 설명 및 첨부 도면에서 구체적인 처리 흐름과 같은 많은 특정 상세들이 본 발명의 보다 전반적인 이해를 제공하기 위해 나타나 있다. 이들 특정 상세들없이 본 발명이 실시될 수 있다는 것은 이 기술분야에서 통상의 지식을 가진 자에게 자명할 것이다. 그리고 본 발명의 요지를 불필요하게 흐릴 수 있는 공지 기능 및 구성에 대한 상세한 설명은 생략한다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings. Many specific details are set forth in the following description and in the accompanying drawings to provide a more general understanding of the invention. It will be apparent to those skilled in the art that the present invention may be practiced without these specific details. And a detailed description of known functions and configurations that may unnecessarily obscure the subject matter of the present invention will be omitted.

우선 본 발명은 전술한 도 1에 보인 스캐너와 같은 화상입력장치에 있어서 기존 방법의 문제점을 해결하기 위해 에지의 방향성 및 연속성을 고려하여 에지부분을 이치화한다. 즉, 에지의 연속성이 존재할 경우 이치화 결과가 직선화되도록 한다. 그리고 에지가 존재하지 않는 부분에 대해서는 종래 기술의 처리방법을 그대로 적용한다.First, the present invention binarizes an edge part in consideration of the directionality and continuity of the edge in order to solve the problem of the conventional method in the image input apparatus such as the scanner shown in FIG. In other words, if the continuity of the edge is present, the binarization result is straightened. In the case where the edge does not exist, the prior art processing method is applied as it is.

도 4는 이를 위한 본 발명의 실시예에 따른 처리 흐름도로서, 편의상 1화소를 이치화하는 예를 보인 것으로, 모든 화소들에 대해 동일하게 적용된다. 이러한 도 4의 흐름도에 따른 도 1의 이미지 프로세서(104)에 의해 수행되도록 프로그램한다.4 is a flowchart illustrating a process of binarizing one pixel for convenience, according to an exemplary embodiment of the present invention. The same applies to all pixels. It is programmed to be performed by the image processor 104 of FIG. 1 according to the flow chart of FIG. 4.

상기와 같은 상태에서 원고감지센서(112)에 의해 원고 장착이 감지되고 조작 패널(110)로부터 스캔 시작명령이 입력됨에 따라 중앙처리장치(100)의 제어에 의해 이미지 프로세서(104)가 스캔 유니트(102)에 의해 원고를 스캔하기 시작하면, 스캔 유니트(102)로부터 원고 스캔에 따른 화상신호가 출력된다. 그러면 이미지 프로세서(104)는 스캔 유니트(102)로부터 출력되는 화상신호를 디지털 화상데이터로 변환한후, 스캔하는 라인단위로 화소값을 램(108)에 저장하고 도 4의 흐름도에 따른 이치화를 수행한다. 이하의 설명에서 스캔 화상을 저장하는 라인수는 3라인을 기본값으로 하는 것으로 가정한다.As the document is detected by the document detection sensor 112 and the scan start command is input from the operation panel 110 in the above state, the image processor 104 is controlled by the central processing unit 100 under the control of the scanning unit ( When the original is scanned by 102, an image signal corresponding to the original scan is output from the scanning unit 102. Then, the image processor 104 converts the image signal output from the scan unit 102 into digital image data, stores the pixel value in the RAM 108 in units of lines to be scanned, and performs binarization according to the flowchart of FIG. 4. do. In the following description, it is assumed that the number of lines for storing the scanned image is based on three lines as a default.

먼저 이미지 프로세서(104)는 도 4의 (300)단계에서 이치화할 주목화소를 지정한후, (302)단계에서 에지의 크기 및 방향성을 검출한다. 이때 도 5(a)∼도 5(d)에 보인 바와 같이 방향성을 결정할 수 있는 4개의 윈도우를 사용하여 에지의 크기 및 방향성을 검출한다. 상기 도 5(a)는 0˚방향의 윈도우를 보인 것이고, 도 5(b)는 45˚방향의 윈도우를 보인 것이며, 도 5(c)는 90˚방향의 윈도우를 보인 것이며, 도 5(d)는 135˚방향의 윈도우를 보인 것이다. 상기 도 5(a)∼도 5(d)의 윈도우는 3개의 라인, 즉 i번째 라인의 j열의 화소를 주목화소라 할 때 i-1번째 라인과 i번째 라인과 i+1번째 라인 각각의 j-1열, j열, j+1열의 화소에 대해 해당하는 가중치를 주는 것으로, 이러한 에지검출방법은 기존에 이미 공지된 Kirsch방법을 보인 것이다. 그러므로 이에 대한 상세한 설명은 생략한다.First, the image processor 104 designates the pixel of interest to be binarized in operation 300 of FIG. 4, and then detects the size and direction of the edge in operation 302. At this time, as shown in Figs. 5 (a) to 5 (d), the size and the direction of the edge are detected using four windows that can determine the direction. FIG. 5 (a) shows a window in the direction of 0 °, FIG. 5 (b) shows a window in the direction of 45 °, and FIG. 5 (c) shows a window in the direction of 90 °, and FIG. 5 (d). ) Shows the window at 135˚. 5 (a) to 5 (d) show three pixels, i.e., i-th line, i-th line and i + 1th line, respectively, By giving the corresponding weights to the pixels of the j-1 column, the j column, and the j + 1 column, the edge detection method is a known Kirsch method. Therefore, detailed description thereof will be omitted.

상기와 같이 4가지의 윈도우를 사용하는 에지검출방법에 의해 에지의 크기 및 방향성을 검출한후 이미지 프로세서(104)는 (304)단계에서 에지의 크기, 즉 4개의 윈도우를 사용하여 얻은 4개의 크기값 중에서 가장 큰 값을 미리 설정된 역치와 비교한다. 이때 에지의 크기가 역치보다 크면 주목화소가 문자의 경계부분임을 의미하고, 역치보다 작으면 주목화소가 문자의 경계부분이 아니고 문자 내부나 배경 또는 그림임을 의미한다.After detecting the size and orientation of the edge by the edge detection method using the four windows as described above, the image processor 104 determines the size of the edge, that is, four sizes obtained by using four windows in step 304. The largest of the values is compared with a preset threshold. At this time, if the size of the edge is larger than the threshold, the pixel of interest is the boundary of the text. If the edge is smaller than the threshold, the pixel of interest is not the boundary of the text, but the interior, background, or picture.

이때 만일 에지의 크기가 역치보다 작으면, 주목화소가 문자의 경계부분이 아니고 문자 내부나 배경 또는 그림이므로 이미지 프로세서(104)는 (318)∼(334)단계를 수행한다. 상기 (318)∼(334)단계는 전술한 도 2의 (212)∼(228)단계와 동일하다. 그러므로 이에 대한 설명은 생략한다.At this time, if the size of the edge is smaller than the threshold value, the image processor 104 performs steps 318 to 334 because the pixel of interest is not the boundary of the letter but the inside or the background or the picture. Steps 318 to 334 are the same as steps 212 to 228 of FIG. 2 described above. Therefore, description thereof is omitted.

이와달리 상기 (304)단계에서 에지의 크기가 역치보다 크면, 주목화소가 일단은 문자의 경계부분으로 볼 수 있으나, 노이즈 성분으로 인한 화질 저하나 에지가 직선화되지 못하게 되는 것을 방지하기 위해 (306)∼(316)단계에서 에지의 방향성 및 연속성을 고려하여 에지부분을 이치화한다.On the other hand, if the size of the edge is larger than the threshold in step 304, the pixel of interest may be viewed as the boundary of the character. However, in order to prevent the degradation of the image quality or the straightening of the edge due to the noise component, the pixel may not be straightened (306). In step 316, the edge portion is binarized in consideration of the directionality and continuity of the edge.

상기 (306)단계에서 이미지 프로세서(104)는 상기 (302)단계에서 검출된 에지의 방향성에 따라 이전에 검출된 에지와의 연속성 여부를 결정한다. 이때 전술한 바와 같은 도 5(a)∼5(d)의 윈도우에 의해 0˚, 45˚, 90˚, 135˚중에 한 방향으로 결정된 에지의 방향성에 따라 도 6(a)∼6(d)중에 하나를 적용하여 연속성을 조사한다. 상기 도 6(a)∼6(d)는 에지로 검출된 주목화소를 x라 할 때 빗금친 화소와 연속성을 조사하는 것을 보인 것이다. 즉, 주목화소가 0˚의 방향성을 가지는 에지인 것으로 검출된 경우에는 도 6(a)와 같이 빗금친 부분에 주목화소와 같은 0˚의 에지가 있는가를 조사한다. 이와 마찬가지로 주목화소가 45˚의 방향성을 가지는 에지인 것으로 검출된 경우에는 도 6(b)와 같이 빗금친 부분에 주목화소와 같은 45˚의 에지가 있는가를 조사하고, 주목화소가 90˚의 방향성을 가지는 에지인 것으로 검출된 경우에는 도 6(c)와 같이 빗금친 부분에 주목화소와 같은 90˚의 에지가 있는가를 조사하며, 주목화소가 135˚의 방향성을 가지는 에지인 것으로 검출된 경우에는 도 6(d)와 같이 빗금친 부분에 주목화소와 같은 135˚의 에지가 있는가를 조사한다. 이와 같이 연속성을 조사하므로써 에지의 직선화를 이룰 수 있게 된다.In step 306, the image processor 104 determines whether or not the continuity with the previously detected edge is based on the direction of the edge detected in step 302. At this time, according to the directionality of the edge determined in one direction among 0 °, 45 °, 90 °, and 135 ° by the windows of FIGS. 5 (a) to 5 (d) as described above, FIGS. Examine the continuity by applying one of them. 6 (a) to 6 (d) show that the pixel of interest detected as an edge is examined for continuity with the shaded pixel. That is, when it is detected that the pixel of interest is an edge having a directionality of 0 °, it is checked whether there is an edge of 0 ° like the pixel of interest in the hatched portion as shown in Fig. 6A. Similarly, when the pixel of interest is detected to be an edge having a directionality of 45 °, it is checked whether the pixel of interest has an edge of 45 ° like the pixel of interest as shown in FIG. If it is detected that the branch has an edge of 90 degrees, such as the pixel of interest, in the hatched portion as shown in Fig. 6 (c), and if it is detected that the pixel of interest has an directional edge of 135 °, Fig. 6 Investigate whether there are 135 ° edges, such as the pixel of interest, in the hatched areas as shown in (d). By examining the continuity in this way, the edge can be straightened.

상기 (306)단계에서 연속성을 조사한후, 이미지 프로세서(104)는 (308)단계에서 조사 결과가 연속성이 있는가를 검사한다. 이때 연속성이 있다면 주목화소를 같은 방향의 이전 화소와 동일하게 이치화하면 에지가 직선화될 것이다. 이와 같이 에지의 직선화를 위해 이미지 프로세서(104)는 (310)단계에서 주목화소를 같은 방향의 이전 화소와 동일한 값으로 이치화한다. 이때 이전 화소는 주목화소의 방향에 따라 상기한 도 6(a)∼6(d)중에 하나에 있는 빗금친 화소가 된다.After examining the continuity in step 306, the image processor 104 checks whether the survey results are continuity in step 308. If there is continuity, the edge will be straightened if the pixel of interest is binarized with the previous pixel in the same direction. In order to straighten the edges, the image processor 104 binarizes the pixel of interest to the same value as the previous pixel in the same direction in step 310. At this time, the previous pixel becomes the hatched pixel in one of Figs. 6 (a) to 6 (d) according to the direction of the pixel of interest.

이와달리 상기 (308)단계에서 연속성이 없다면 주목화소에 대해 이전 화소를 고려할 필요가 없다. 이에따라 이미지 프로세서(104)는 (312)∼(314)단계에서 전술한 도 2의 (206)∼(210)단계에서와 동일하게 국부 역치를 적용하여 이치화한다. 즉, 이미지 프로세서(104)는 (218)단계에서 주목화소값을 국부 역치와 비교하여, 국부 역치보다 크면 (220)단계에서 주목화소를 백으로 이치화하고 국부 역치보다 작으면 (222)단계에서 주목화소를 흑으로 이치화한다.On the other hand, if there is no continuity in step 308, it is not necessary to consider the previous pixel for the pixel of interest. Accordingly, the image processor 104 applies and localizes the local threshold in the same manner as in the steps 206 to 210 of FIG. 2 described above in the steps 312 to 314. That is, the image processor 104 compares the pixel value of interest with the local threshold in step 218, binarizes the pixel of interest to white in step 220 if it is greater than the local threshold, and in step 222 if it is smaller than the local threshold. The pixels are binarized to black.

여기서 국부 역치를 결정하는 방법을 예를들어 보이면 도 7(a)∼7(d)와 같다. 도 75(a)는 주목화소의 에지가 0˚의 방향성을 가지는 경우, 빗금친 부분의 평균값을 국부역치로 설정하는 것을 보이고 있다. 이와 마찬가지로 도 7(b), 7(c), 7(d)는 각각 주목화소의 에지가 45˚, 90˚, 135˚의 방향성을 가지는 경우, 빗금친 부분의 평균값을 국부역치로 설정하는 것을 보인다. 이때 특별히 ▩로 표시한 부분은 가중치를 부여할 수도 있다.Herein, a method of determining the local threshold is shown in Figs. 7 (a) to 7 (d). Fig. 75 (a) shows that when the edge of the pixel of interest has a direction of 0 degrees, the average value of the hatched portion is set to the local threshold. Similarly, Figs. 7 (b), 7 (c), and 7 (d) show that when the edge of the pixel of interest has directionalities of 45 °, 90 °, and 135 °, the average value of the hatched portion is set to the local threshold. see. At this time, the portion marked with ▩ may be assigned a weight.

상술한 하나의 화소에 대한 이치화과정을 종료하면 리턴하여 전체 화상에 대한 이치화를 완료할때까지 다시 다음의 화소를 주목화소로 지정하고 이치화하는 동일한 동작이 반복적으로 이루어진다.When the above-mentioned binarization process for one pixel is finished, the same operation is repeated until the next pixel is designated and binarized again until the binarization of the entire image is completed.

따라서 에지의 방향성 및 연속성을 고려하여 이치화하므로 에지의 직선화가 이루어질 수 있으며 노이즈로 인한 고립점 생성을 방지할 수 있다. 그러므로 노이즈 성분이 존재하는 화상에 대하여도 실제 문자의 에지만을 우수하게 추출할 수 있다.Therefore, since the edges are binarized in consideration of the directionality and continuity of the edges, the edges can be straightened and generation of isolation points due to noise can be prevented. Therefore, only the edges of the actual characters can be excellently extracted even for an image having a noise component.

상술한 바와 같이 본 발명은 에지의 연속성이 존재할 경우 이치화 결과가 직선화됨으로써 압축 부호화할 때 압축율을 높일 수 있으며, 노이즈 성분이 존재하는 화상에 대하여도 실제 문자의 에지만을 우수하게 추출함으로써 화질을 향상시킬 수 있는 잇점이 있다.As described above, the present invention can improve the compression ratio when compressing and encoding the binarization result when the edge continuity exists, and improve the image quality by extracting only the edge of the actual character excellently even for the image having the noise component. There is an advantage to this.

Claims

In the method for binarizing an image obtained by scanning a document in which characters and pictures are mixed,

Detecting a size and an orientation of an edge of a pixel of interest of the scanned image;

Comparing the size of the edge with a preset threshold to determine whether the pixel of interest is a boundary of a character;

If it is detected that the pixel of interest is a boundary of the character, checking whether the edge according to the directionality is continuous;

Binning the pixel of interest to the same value as the binarization value for the previous pixel in the same direction when the edge is continuous;

Binning the pixel of interest by applying a local threshold when there is no continuity of the edge;

Binarizing the pixel of interest to a white pixel if it is detected that the pixel of interest is not a boundary of a character but a background;

Binarizing the pixel of interest to black pixels if the pixel of interest is detected as being inside the character rather than the boundary or background of the character;

Checking whether there is a continuity of the picture when the pixel of interest is detected as a picture rather than a boundary or background of the letter or the inside of the letter;

Binarizing the pixel of interest by halftone expression when there is continuity of the picture;

And binarizing the pixel of interest by applying a global threshold if there is no continuity of the picture.

The binarization method according to claim 1, wherein the detecting of the size and the directionality detects the size and the directionality in four directions of 0 °, 45 °, 90 °, and 135 ° with respect to the pixel of interest.

2. The method of claim 1, wherein the continuity checking process checks the previous pixel in the same direction as the directionality of the pixel of interest, and detects that there is continuity if the previous pixel is the same directional edge, otherwise there is no continuity. Binarization method characterized by the above-mentioned.