KR100206825B1

KR100206825B1 - The method of processing document to image

Info

Publication number: KR100206825B1
Application number: KR1019960066639A
Authority: KR
Inventors: 이현주
Original assignee: 구자홍; 엘지전자주식회사
Priority date: 1996-12-17
Filing date: 1996-12-17
Publication date: 1999-07-01
Also published as: KR19980048098A

Abstract

본 발명은 문서 영상화 방법에 관한 것으로, 종래에는 영문자의 경우와 하나의 문장열(text line) 안에서는 중심이나 바닥 점들이 문장 열의 기준선과 일치한다는 전제하에 문서 영상의 기울기를 측정하는 방법을 이용하여 문서의 영상화가 가능하였으나 한글이나 일본어 등 한자가 여러 개의 구성요소로 이루어져 있는 문자에 대해서는 이러한 방법을 적용할 수 없는 문제점이 있다. 따라서 본 발명은 입력된 문단에 대하여 저장된 폭 만큼 수평화소 접합을 수행하여 문자열이 하나의 화소가 되도록 연결하는 제1단계와; 상기 제1단계에서 연결된 문자열의 외곽선을 추출하는 제2단계와; 상기 제2단계에서 추출한 외곽선중 제일 아래에 위치하는 바닥열 화소들만을 추출하는 제3단계와; 상기 제3단계에서 추출한 화소들을 최소의 오차를 가지고 연결하는 직선을 찾아 기울기를 계산하는 제4단계를 통해 얻어지는 기울기를 이용하여 영상의 영역 분할, 문장 줄 분리, 인식 등의 과정에서 사용하고, 또한 상기과정을 통해 계산되는 각도를 이용하여 문서 영상을 회전시켜 저장하면 불필요한 부분의 영상을 제거할 수 있으므로 저장 공간을 줄일 수 있고, 사용자가 기울어진 영상을 볼 때 발생하는 불편함을 제거한다.BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document imaging method. Conventionally, a document using a method of measuring a tilt of a document image on the premise that a center or a bottom point coincides with a reference line of a sentence string in the case of an English letter and a sentence line in a text line. Although imaging is possible, there is a problem that this method cannot be applied to a character composed of several components such as Korean or Japanese. Accordingly, the present invention provides a first step of performing horizontal pixel bonding on the input paragraph by the stored width to connect the character strings into one pixel; A second step of extracting an outline of the string connected in the first step; A third step of extracting only bottom row pixels positioned at the bottom of the outline extracted in the second step; It is used in the process of region division, sentence line separation, recognition, etc. by using the slope obtained through the fourth step of calculating a slope by finding a straight line connecting the pixels extracted in the third step with the minimum error. By rotating and storing the document image by using the angle calculated through the above process, the unnecessary portion of the image can be removed, thereby reducing the storage space and eliminating the inconvenience caused when the user views the tilted image.

Description

Document imaging method

본 발명은 스캐너 또는 카메라 등을 이용하여 문서를 영상화 할 때 발생하는 기울어짐 각을 이미지 처리를 통해 계산하기 위한 것으로, 특히 문서를 영상화 할 때 기울어짐 각도를 이미지의 처리를 통해서 계산하고 이 계산된 각도를 이용하여 문서 영상을 회전시켜 저장하고 불필요한 부분의 영상을 제거하여 저장공간을 줄일 수 있도록 한 문서 영상화 방법에 관한 것이다.The present invention is to calculate the inclination angle generated when the image of the document using a scanner or a camera through the image processing, in particular when calculating the inclination angle when imaging the document through the processing of the image and calculated The present invention relates to a document imaging method in which a document image is rotated and stored using an angle, and an unnecessary portion of the image is removed to reduce a storage space.

종래에는 문서를 영상화 할 경우, 입력되는 문서의 잘못된 위치, 스캐너 롤러의 부정확한 컨트롤 등으로 인해 발생하는 기울어짐은 문서 영상을 처리하고 인식하는데 있어 많은 문제점을 발생시킨다.Conventionally, when imaging a document, inclination caused by an incorrect position of an input document, inaccurate control of a scanner roller, or the like causes many problems in processing and recognizing a document image.

따라서 문서 영상의 기울기를 측정하는 방법을 이용하여 문서를 영상화하였다.Therefore, the document was imaged using a method of measuring the tilt of the document image.

상기에서 문서 영상의 기울기를 측정하는 방법은, 각 문장 열로 부터 연결 화소(connected component)를 추출하고 이들의 중심점 또는 윤곽선의 바닥 점들을 가장 근사하게 연결하는 직선을 찾는 방법이다.The method of measuring the inclination of the document image is a method of extracting a connected component from each sentence column and finding a straight line that most closely connects the center points or the bottom points of the contour lines.

그러나, 상기와 같은 종래의 기술은 제1도에서와 같은 영문자의 경우와 하나의 문장 열(text line) 안에서는 중심이나 바닥 점들이 문장 열의 기준선과 일치한다는 전제하에 개발되었으나 제2도에서와 같은 한글이나 일본어 등 한자가 여러 개의 구성요소로 이루어져 있는 문자에 대해서는 이러한 방법을 적용할 수 없는 문제점이 있다.However, the conventional technique as described above has been developed under the premise that the center or bottom points coincide with the reference line of the sentence string in the case of an English letter as in FIG. 1 and in a text line. There is a problem that this method cannot be applied to a character composed of several components such as Chinese or Japanese.

제1도는 영어 문장 열과 바닥 화소들을 보여주는 설명도.1 is an explanatory diagram showing English sentence rows and bottom pixels.

제2도는 한글 문장 열과 바닥 화소들을 보여주는 설명도.2 is an explanatory diagram showing a Hangul sentence column and bottom pixels.

제3도는 본 발명의 문서 영상화 방법에 대한 동작 과정도.3 is a flowchart illustrating an operation of the document imaging method of the present invention.

제4도는 입력되는 한글 문단의 한 예를 보여주는 예시도.4 is an exemplary view showing an example of an input of Hangul paragraph.

제5도는 수평 화소 접합을 거친 한글 문단의 한 예를 보여주는 예시도.5 is an exemplary diagram showing an example of a Hangul paragraph through a horizontal pixel junction.

제6도는 제5도에서, 외곽선 추출한 상태를 보여주는 추출도.6 is an extraction diagram showing a state in which the outline extraction in FIG.

제7도는 제6도에서, 바닥 화소 추출을 보여주는 추출도.FIG. 7 is an extraction diagram showing bottom pixel extraction in FIG.

따라서, 상기에서와 같은 문제점을 해결하기 위한 본 발명 문서 영상화 방법은, 입력된 문단에 대하여 저장된 폭 만큼 수평화소 접합을 수행하여 문자열이 하나의 화소가 되도록 연결하는 제1단계와; 상기 제1단계에서 연결된 문자열의 외곽선을 추출하는 제2단계와; 상기 제2단계에서 추출한 외곽선중 제일 아래에 위치하는 화소들만을 추출하는 제3단계와; 상기 제3단계에서 추출한 화소들을 최소의 오차를 가지고 연결하는 직선을 찾아 기울기를 계산하는 제4단계로 이루어진다.Accordingly, the document imaging method of the present invention for solving the above problems comprises: a first step of performing horizontal pixel conjugation with a stored width on an input paragraph so as to concatenate a string into one pixel; A second step of extracting an outline of the string connected in the first step; A third step of extracting only pixels located at the bottom of the outline extracted in the second step; A fourth step of calculating a slope by finding a straight line connecting the pixels extracted in the third step with the minimum error.

상기 각 단계로 이루어진 방법을 수행하기 위한 본 발명의 동작 및 작용효과에 대하여 상세히 설명하면 다음과 같다.Referring to the operation and effect of the present invention for performing the method consisting of the above steps in detail as follows.

제4도에서와 같이 목표달성을 할 수 있도록 ..............고객에게 최와 같은 한글문단이 입력되면 이 한글 문단에 대하여 미리 지정된 폭만큼 수평화소 접합을 수행하여 문자열이 하나의 화소가 되도록 연결한다.As shown in Fig. 4, in order to achieve the goal, when the Korean paragraph like Choi is inputted to the customer, horizontal pixel joining is performed for the specified Hangul paragraph. Concatenate the strings into one pixel.

상기에서 수평화소 접합은 영상에서 두 검은 점 사이의 수평거리가 역치보다 작은 경우 그 검은 화소 사이의 흰 화소들을 검은 화소로 채우게 된다.In the horizontal pixel bonding, when the horizontal distance between two black points in the image is smaller than the threshold, the white pixels between the black pixels are filled with black pixels.

여기서, 주어지는 역치는 문자 폭의 두 배 이상에 해당하므로, 이 과정에 의해 하나의 문자열은 글자와 글자, 단어와 단어 사이가 연결되고, 대부분의 경우 문자열이 하나의 연결 화소가 된다.Here, the given threshold is more than twice the width of the character, and by this process, one string is connected between letters and letters, words and words, and in most cases, the string is one connection pixel.

상기에서와 같이 수평화소 접합을 수행하여 문자열을 하나의 연결 화소로 만들면 제5도에서와 같이 된다.As described above, when the horizontal pixel bonding is performed to make the character string into one connection pixel, it is as shown in FIG.

이렇게 하나의 연결 화소로된 문자열중에서 하나의 문자열에 대해 먼저, 제6도에서와 같이 문자열의 외곽선을 추출하고, 이 외곽선을 포함하는 제일 작은 사각형을 찾는다. 상기에서 찾은 제일 작은 사각형의 안쪽으로 부터 각 화소열 마다 제일 낮은 위치에 존재하는 외곽선 화소를 찾는다.For one string among the strings of one connection pixel, first, the outline of the string is extracted as shown in FIG. 6, and the smallest rectangle containing the outline is found. From the inside of the smallest rectangle found above, the pixel of the outline located at the lowest position is found for each pixel column.

이와같은 과정을 통하게 되면 제7도에서와 같이 하나의 문자열에 대해 제일 아래에 위치하는 바악 열 화소들만을 추출하게 된다.Through this process, as shown in FIG. 7, only the bottom column pixels located at the bottom of one string are extracted.

상기에서 추출된 바닥 열 화소들을 가장 잘 표현하는 직선을 찾기 위해 least square 방식을 사용하여 직선을 추출한다.A straight line is extracted using a least square method to find a straight line that best represents the extracted bottom column pixels.

즉, 찾고자 하는 직선의 식을 아래와 같이 두자.In other words, let's put the equation of the straight line to find as below.

y = mx + by = mx + b

추출된 화소들을 (x,y)로, 그 수를 n이라고 둘 때, 찾고자 하는 것은 다음을 최소화하는 m과 b이다.When the extracted pixels are (x, y) and the number is n, what we want to find is m and b to minimize the following.

이러한 조건을 만족하는 m과 b는 다음과 같이 주어진다.M and b satisfying these conditions are given by

상기에서와 같은 식을 통해 계산된 기울기를 이용하여 영상의 영역 분할, 문장 줄 분리·인식 등의 과정에서 사용하고, 또한 상기과정을 통해 계산되는 각도를 이용하여 문서 영상을 회전시켜 저장하면 불필요한 부분의 영상을 제거할 수 있으므로 저장 공간을 줄일 수 있고, 사용자가 기울어진 영상을 볼 때 발생하는 불편함을 제거한다.It is used in the process of segmentation of the image, sentence line separation and recognition, etc. by using the slope calculated through the equation as described above, and it is unnecessary when rotating and storing the document image using the angle calculated through the above process. Since the image can be removed, the storage space can be reduced, and the inconvenience caused when the user views the tilted image is eliminated.

상술한 바와 같이, 본 발명은 문서를 영상화 할 때 발생하는 기울어짐 각도를 이미지의 처리를 통해서 계산하고, 이 계산된 기울기로 영상의 영역 분할, 문장 줄 분리, 인식 등의 과정에 사용하고 또한, 계산된 각도를 이용하여 문서 영상을 회전시켜 저장하면 불필요한 부분의 영상을 제거할 수 있으므로 저장 공간을 줄일 수 있고, 사용자가 기울어진 영상을 볼 때 발생하는 불편함을 제거하도록 한 효과가 있다.As described above, the present invention calculates the inclination angle generated when the document is imaged through the processing of the image, and uses the calculated inclination in the process of region division, sentence line separation, recognition, etc. of the image, By rotating and storing the document image by using the calculated angle, the unnecessary portion of the image can be removed, thereby reducing the storage space and removing the inconvenience caused when the user views the tilted image.

Claims

A first step of performing horizontal pixel conjugation on the input paragraph by the stored width and concatenating the strings into one pixel; A second step of extracting an outline of the string connected in the first step; A third step of extracting only bottom row pixels positioned at the bottom of the outline extracted in the second step; And a fourth step of calculating a slope by finding a straight line connecting the pixels extracted in the third step with the minimum error.

The method of claim 1, wherein the horizontal pixel bonding fills the black pixels between the two black pixels when the horizontal distance between the two black points in the image is smaller than the specified threshold.

3. The method of claim 2, wherein the threshold is given at least twice the character width.

The method of claim 1, wherein the extraction of the bottom row pixels comprises: a first process of extracting an outline of a character; A second step of finding the smallest rectangle including the outer mountain extracted in the first step; And a third step of searching for the outline pixels located at the lowest positions of each pixel column while scanning the rectangles found in the second step.