WO2001077998A1

WO2001077998A1 - Automatic character recognition method and device

Info

Publication number: WO2001077998A1
Application number: PCT/EP2001/003905
Authority: WO
Inventors: Martin Hund
Original assignee: Cgk Computer Gesellschaft Konstanz Mbh
Priority date: 2000-04-06
Filing date: 2001-04-05
Publication date: 2001-10-18
Also published as: EP1272970A1; ATE278991T1; DE10017081A1; DE50103991D1; EP1272970B1

Abstract

The invention relates to an automatic character recognition method and device, whereby an image area comprising the characters to be recognized is divided into a multitude of pixels arranged in lines and columns, and each pixel is assigned either to an item of useful information that is to be allocated to the characters to be recognized or to an item of disturbance information that is to be suppressed. In order to improve the distinction of the characters to be recognized from a background, the assignment for each pixel is carried out based on the pixels arranged in relation thereto and located in a surrounding area of a predetermined size.

Description

description

Method and device for automatic character recognition

The invention relates to a method for automatic character recognition according to the preamble of claim 1 and a device for automatic character recognition according to the preamble of claim 20.

In known methods or devices for automatic character recognition, a separation between useful information and interference information is generally carried out before the actual identification process. In this case, an image area comprising the characters to be recognized is subdivided into a plurality of pixels arranged in rows and columns and each pixel is assigned either to useful information to be attributed to the characters to be recognized or to interference information to be suppressed. In this way, a dark font is separated from a light background, which in the simplest case is done by binarization. The decision as to whether a pixel is to be suppressed or retained, that is to say to be associated with the useful information to be assigned to the characters to be recognized, is decided on the basis of the color of the pixel, with only one pixel being considered in each assignment.

In the known methods or devices, however, problems arise if the background of the characters to be recognized is printed in color. For example, forms are often printed in color and the area to be filled is marked with a colored frame. The color used to fill out the forms is usually not defined. A separation of writing and background by means of a simple binary It is then not possible to do this. If the image area is, for example, a green colored background with gray characters, the existing green pixels of the background must be assigned to the interference information based on their color. So that this color filtering also works on different print qualities of the form, a corresponding tolerance range must be included in the color filter, taking into account the fluctuations in the print quality and paper color, so that the range of the colors to be filtered must be increased by this tolerance range. In some cases, however, the exact same colors result from color mixing of the gray printing with the green background color, so that pixels that belong to the gray characters are inevitably assigned to the background.

For clarification, the original of an image area with characters arranged on a background is shown in FIG. 7, it being assumed that the background has a different color than the characters. 8 shows the result of the known color filtering after binarization. Although not all pixels of the background have been suppressed, the characters are already thinned out considerably.

Another example is shown in FIGS. 9 and 10. FIG. 10 shows the result of the known color filtering after binarization, which was applied to the image area shown in FIG. 9. It also shows here that a clean separation of foreground and background is not possible using the known methods or devices.

EP-A-0 449 380 describes a method for automatic character recognition, with each pixel within one Image area is then examined whether it can be assigned useful information or interference information. The neighboring pixels surrounding a central pixel are included in the investigation.

DE-A-198 28 396 discloses a method for processing image data in which an image area of a form is scanned in a grid-like manner. The hue, saturation and brightness are determined for each pixel. With the help of the fuzzy technique, the pixels are assigned to color classes and a gray value is assigned to the respective pixel.

DE-A-44 45 386 and DE-A-195 17 178 relate to methods for cleaning up background information in electronically scanned images. When assessing whether a picture element is to be assigned useful information or background information, neighboring picture elements are also evaluated.

The object of the present invention is to create a method for automatic character recognition according to the preamble of claim 1 and a device for automatic character recognition according to the preamble of claim 20, by means of which the separation of the characters to be recognized from a background is improved.

To solve this problem there is a method for automatic character recognition, in which an image area comprising the characters to be recognized is subdivided into a plurality of pixels arranged in rows and columns and each pixel is assigned either to useful information to be attributed to the characters to be recognized or to interference information to be suppressed, characterized in that the assignment for each pixel is based on the Exercise range of predetermined size is arranged for this arranged pixels.

Because the assignment of the respective pixels to useful information or store information is carried out not only on the basis of the color of the pixel but based on a defined environment of the respective pixel, the foreground and background can also be cleanly separated if the same mixed colors are used in some cases occur with the characters to be recognized and the background printing. As a result, an optimal suppression of the background is achieved and at the same time a thinning out of the characters to be recognized is avoided, so that the quality of the automatic character recognition is improved.

An advantageous embodiment of the method according to the invention is characterized in that a square search matrix with an odd number of rows and columns is selected as the surrounding area, the pixel to be assigned being located in the center of the search matrix for each assignment. Such a search matrix can be carried out step by step in the row and column direction over the image area to be viewed, each pixel once becoming the center of the scanning window formed by the search matrix.

An advantageous embodiment of the method according to the invention is characterized in that the number of rows and columns of the search matrix is chosen in relation to the number of rows and columns of the image area, which increases the processing speed when examining the pixels arranged within the search matrix. An advantageous embodiment of the method according to the invention is characterized in that each assignment of each of the pixels arranged within the search matrix is assigned to a background area, a tolerance area or a drawing area, the areas being obtained by subdivision of a predetermined color space. In this way, a differentiated evaluation of the pixels with regard to their belonging to the background of the image area is made possible, whereby the exact criteria for the assignment can be defined depending on the problem.

An advantageous embodiment of the method according to the invention is characterized in that with each assignment of each of the pixels arranged within the search matrix, a background area, a tolerance area, a drawing area or a foreground area are allocated, the areas being obtained by subdivision of a predetermined color space. In addition to suppressing the background, the additional formation of a foreground area in the predetermined color space also enables the foreground, ie the characters to be recognized, to be directly extracted from the viewed image area.

An advantageous embodiment of the method according to the invention is characterized in that the background area is assigned colors which do not represent mixed colors with a color contained in the characters to be recognized. In this way, pixels can be classified that are definitely part of the background of the image area.

An advantageous embodiment of the method according to the invention is characterized in that the tolerance such colors are richly assigned whose saturation and brightness values lie in a predetermined fluctuation range around the background area. In this way, when examining the pixels within the search matrix, a division of the pixels that is more differentiated from loose binarization and that can be defined depending on the problem can be made. The tolerance range can include, for example, fluctuations in saturation by ± 10% and fluctuations in brightness by ± 5%.

An advantageous embodiment of the method according to the invention is characterized in that colors are assigned to the drawing area that do not belong to the background area or the tolerance area. In this way, pixels can be set that lie outside the saturation and brightness fluctuations that define the tolerance range.

An advantageous embodiment of the method according to the invention is characterized in that the foreground area is assigned those colors which are only contained in the characters to be recognized. In this way, those pixels can be classified that certainly do not belong to the background, which enables colors to be directly extracted from the characters to be recognized.

An advantageous embodiment of the method according to the invention is characterized in that with each assignment the pixel located in the center of the search matrix is assigned to the interference information if either the pixel has been assigned to the background area, the pixel to the tolerance area and at least one of its immediate neighboring pixels to the background area were allocated, or if the pixel has been assigned to the tolerance range and at least one of the pixels within the search matrix has been assigned to the background range and all pixels arranged on at least one predetermined connecting line between these pixels have been assigned to either the background range or the tolerance range. As a result, the most effective possible analysis of the pixels located within the search matrix and thus the most reliable possible assignment of the pixels located in the center of the search matrix to interference information or useful information is achieved.

An advantageous embodiment of the method according to the invention is characterized in that with each assignment the pixel located in the center of the search matrix is assigned to the useful information if the pixel was assigned to the foreground area. In this way, those pixels can be removed immediately that are definitely not part of the background. In this way, thinning of the characters is largely avoided, particularly in the case of a relatively dark background and comparatively weak printing of the characters to be recognized.

An advantageous embodiment of the method according to the invention is characterized in that the assignment of a pixel to the useful information takes place primarily to an assignment to the interference information. In this way, a font protection mechanism can be achieved.

An advantageous embodiment of the method according to the invention is characterized in that the shape and length of the connecting lines and the size of the search matrix are defined at the beginning of the character recognition. With this- This definition enables a wide range of adaptations to the respective problem.

An advantageous embodiment of the method according to the invention is characterized in that logical

Links is determined whether the pixels arranged on the at least one predetermined connecting line are allocated to the background area or the tolerance area. Such an embodiment is particularly simple and inexpensive to implement.

An advantageous embodiment of the method according to the invention is characterized in that all permissible connecting lines are checked simultaneously. However, all permissible connecting lines can also be checked within one shift cycle. As a result, when using modern programmable components or ASICs, a processing speed in the range of more than 100 MPixel / s can be achieved inexpensively.

An advantageous embodiment of the method according to the invention is characterized in that each pixel assigned to the interference information is replaced by a known background value. In this way, the method according to the invention can be used as a preprocessing stage for a binarization algorithm.

An advantageous embodiment of the method according to the invention is characterized in that a binarization is carried out directly on the basis of the assignment of a pixel to useful information or interference information. This enables the classification mechanism on which the method according to the invention is based to be used directly. The device according to the invention for automatic character recognition with a device for dividing an image area comprising the characters to be recognized into a plurality of pixels arranged in rows and columns, wherein each pixel can be assigned either useful information to be assigned to the characters to be recognized or storage information to be suppressed characterized in that the allocation can be made for each pixel based on the size of a surrounding area predetermined pixels arranged for this.

The invention is explained in more detail below with reference to the exemplary embodiments shown in the attached figures. Show it:

1 shows a flowchart of an inventive method for automatic character recognition;

2 shows a search matrix positioned on an image surface with pixels allocated to different areas;

3 shows a search matrix with a selection of possible connecting lines;

4 shows an image area after application of a first embodiment of the method according to the invention prior to binarization;

5 shows the image area from FIG. 4 after binarization;

6 shows an image surface after application of a second embodiment of the method according to the invention after binarization; 7 shows an original of an image area comprising several characters;

FIG. 8 shows the image area from FIG. 7 after using a known method for automatic character recognition;

9 shows an original of a further image area comprising various characters;

10 shows the image area from FIG. 9 after using a known method for automatic character recognition.

1, the method for automatic character recognition according to the invention begins in a first step 10 with a division of an image area comprising the characters to be recognized into a plurality of pixels arranged in rows and columns. In the course of the method, each of these pixels is assigned either useful information to be assigned to the characters to be recognized or interference information to be suppressed.

The size of a search matrix is then defined in a step 20, which is carried out step by step in the row and column direction over the image area to be viewed in the course of the method according to the invention. The search matrix is a scanning window constructed in the form of a matrix, with a preferably small number of columns and lines relative to the image size, thereby optimizing the processing speed of the pixels arranged within the search matrix.

Then, in a step 30, any color space (e.g. gray, RGB, HLS, YCRCB, LAB, YIQ, CMYK, ...) is assigned to one of three areas. Here one The first so-called background area is assigned such colors that are certainly part of the background and therefore not one of the characters to be recognized on the image area to be processed. It is therefore necessary to exclude such colors here, the mixed colors between the color of the

Represent background and the color of the characters to be recognized.

A so-called tolerance range is assigned to those colors which, with regard to their brightness and saturation values, lie within a predetermined fluctuation range around the colors of the background. For example, those colors can be assigned to the tolerance range whose saturation values do not deviate from the colors of the background area by more than 10% and the brightness values by not more than 5%.

Colors that have not been assigned to the background or tolerance range are assigned to the so-called drawing range.

Depending on the affiliation of the pixels of the image area to the three areas mentioned, steps 40 to 100 of the method according to the invention are then carried out according to predetermined rules, while the individual pixels of the image area are positioned row by row and column by row in the center of the search matrix individual pixels for useful or interference information. For better clarity, an example of an image area to be processed and a search matrix positioned on it are shown in FIGS. 2 and 3, the individual pixels according to FIG. 2 being assigned to the three areas mentioned.

First, permissible connecting lines are defined in a step 40, which are to be used in the subsequent application of the Rules for assigning the pixels to useful information or interference information are used, as will be explained in more detail below. A selection of possible connecting lines using the example of a 5x5 search matrix is shown in FIG. 3. However, any other connecting lines can be defined depending on the problem.

In a subsequent step 50, a pixel of the image area is first positioned in the center of the search matrix. In the example shown in FIG. 2, a search matrix with a row or column length of 5 pixels is positioned on the image surface, hatched pixels assigned to the background area, pixels assigned to the tolerance area being shown in white, and pixels assigned to the wrong area being shown in black. Furthermore, two permissible connecting lines are drawn with a solid or a dotted line. Each time a new pixel is positioned in the center of the search matrix in step 50, this pixel is based on the rules provided in steps 60, 70 and 80 either the interference information to be suppressed in a step 90 or the useful information to be attributed to the characters to be recognized in one Assigned to step 100 according to FIG. 1, whereupon the next pixel is positioned in rows and columns in the center of the search matrix.

The assignment of the pixel positioned in the center of the search matrix, hereinafter referred to as the center point pixel, is carried out according to FIG. 1 as follows:

First, step 60 queries whether the midpoint pixel belongs to the background area. If this is the case, the center point pixel is interpreted as the background and is consequently assigned directly to the interference information to be suppressed. However, if the midpoint pixel does not belong to the background area, a query is made in step 70 as to whether the midpoint pixel belongs to the tolerance range and whether at least one of the immediately adjacent pixels, of which there are 8 in total, belongs to the background area. If this is affirmed, the center point pixel is also assigned to the interference information to be suppressed.

If this is not the case, but the center point pixel belongs to the tolerance range, but none of the immediate neighboring pixels belongs to the background area, as is the case in the exemplary embodiment shown in FIG. 2, the process continues with step 80. In this step it is queried whether the center point pixel belongs to the tolerance range and at least one of the pixels located in the search matrix belongs to the background area. The latter is the case for the example shown in FIG. 2 for two pixels, which are referred to below as target pixels. In this case, for all permissible connecting lines that were defined in step 40, the pixels arranged on the connecting line between the center point pixel and the respective target pixel are checked to determine whether they belong to the background area or the tolerance area. If there is at least one permissible connecting line in which all the pixels between the center point pixel and the target pixel belong to the background area or the tolerance area, which can be checked by simple logic operations, then the center point pixel is also assigned in step 90 to the interference information to be suppressed. If this is not the case, however, the center point pixel is assigned to the useful information attributable to the characters to be recognized in step 100. In both cases, the next pixel is then the Image area positioned in the center of the search matrix and assigned according to the same rules of steps 60, 70 and 80.

The fact that the maximum length and shape of the permissible connecting lines as well as the size of the search matrix can be individually defined means that various adaptations to the respective problem are possible.

The pixels assigned to the interference information to be suppressed can each be replaced by a known background value, so that the method according to the invention can be used as a preprocessing stage for a binarization algorithm. However, due to its classification mechanism, the method according to the invention is also suitable for binarization on colored image areas with regard to the separation of foreground and background.

FIG. 4 shows the result of the method according to the invention, which was applied to the image area shown in FIG. 7. As a result of the environment-related separation of foreground and background achieved by the method according to the invention, thinning out of the characters to be recognized is effectively avoided, the entire background being suppressed. As a result of the differentiated division of the pixels into the three areas mentioned, a clean separation of foreground and background is achieved even if the same mixed colors occur in part on the one hand and the characters to be recognized on the other hand. As shown in FIG. 5, a subsequent binarization leads to a correct reproduction of the characters to be recognized.

The method according to the invention was used in the exemplary embodiment described above to suppress the background, but can alternatively also be used for direct extraction of the characters to be recognized or combination of suppression of the background and extraction of the characters to be recognized, as will be explained below.

In this case, the color space in step 30 of the flow chart shown in FIG. 1 is divided into four instead of three areas, the fourth area, the so-called foreground area, comprising those colors that are only contained in the characters to be recognized, that is to say with certainty do not belong to the background. If pixels are assigned to this foreground area, these pixels can be assigned to the useful information immediately in accordance with step 100. Such an assignment rule can be used in addition to the assignment rules in steps 60, 70 and 80 m of the method in order to link the underprinting of the background of the image area with a direct removal of the foreground. Such a combination is particularly appropriate if, in the case of weakly printed characters on a relatively dark background, severe thinning of the characters is to be prevented.

The assignment to the foreground area for the purpose of extracting the characters can, however, also be given priority over suppressing the background, which means that a protection mechanism can be achieved.

FIG. 6 shows the result of applying the method according to the invention to the image area shown in FIG. 9, wherein, as described above, the foreground was extracted and then binarized. Here, too, there is a clean separation of foreground and background as a result of the environment-related character recognition according to the invention. The method according to the invention is very simple and inexpensive to implement in a device for automatic character recognition. For each possible color in the color space, a two-bit-wide entry is sufficient for the classification in the background area, tolerance area and drawing area. The required search matrix is also two bits deep. All permissible connecting lines can be checked simultaneously or within one shift cycle. When using modern programmable components or ASICs, a processing speed of up to more than 100 Mpixel / S can be achieved. Using simple logical links, it can be checked whether a permissible connected connecting line between a center point pixel and a target pixel fulfills the prerequisite for the assignment to the useful information or to the interference information to be suppressed.

Claims

claims

1. A method for automatic character recognition, wherein an image area comprising the characters to be recognized is subdivided into a plurality of pixels arranged in rows and columns and each pixel is assigned either ₍ useful information to be attributed to the characters to be recognized or interference information to be suppressed, characterized in that the Assignment for each pixel, based on the pixels arranged in a surrounding area of a predetermined size.

2. The method according to claim 1, characterized in that a square search matrix of odd number of rows and columns is selected as the surrounding area, with the respective pixel to be assigned being located in the center of the search matrix for each assignment.

3. The method according to claim 2, characterized in that the number of rows and columns of the search matrix is chosen to be small in relation to the number of rows and columns of the image area.

4. The method according to claim 2 or 3, characterized in that each assignment of each of the pixels arranged within the search matrix is assigned to a background area, a tolerance area or a drawing area, the areas being obtained by subdivision of a predetermined color space.

5. The method according to claim 2 or 3, characterized in that with each assignment of each of the pixels arranged within the search matrix, a background area, a tolerance area, a drawing area or a foreground area are assigned, the areas being obtained by subdivision of a predetermined color space.

6. The method according to claim 4 or 5, characterized in that the background area is assigned such colors that do not represent mixed colors to a m contained in the character to be recognized.

7. The method according to any one of claims 4 to 6, characterized in that the tolerance range is assigned such colors whose saturation and brightness values lie in a predetermined fluctuation range around colors of the background range.

8. The method according to claim 7, characterized in that the tolerance range comprises saturation fluctuations by ± 10% and holiness fluctuations by ± 5%.

9. The method according to any one of claims 4 to 8, characterized in that the drawing area is assigned those colors that are neither the background area nor the tolerance area.

10. The method according to any one of claims 5 to 9, characterized in that such colors are assigned to the foreground area that are only contained in the characters to be recognized.

11. The method according to any one of claims 4 to 10, characterized in that with each assignment, the pixel located in the center of the search matrix of the interference information is assigned, if either

the pixel has been assigned to the background area, the pixel has been assigned to the tolerance area and at least one of its immediate neighboring pixels has been assigned to the background area, or if the pixel has been assigned to the tolerance area and at least one of the pixels within the search matrix

Background area were allocated and all pixels arranged on at least one predetermined connecting line between these pixels were allocated to either the background area or the tolerance area.

12. The method according to any one of claims 5 to 11, characterized in that with each assignment the pixel located in the center of the search matrix is assigned to the useful information if the pixel was assigned to the foreground area.

13. The method according to claim 12, characterized in that the assignment of a pixel to the useful information takes place primarily to an assignment to the interference information.

14. The method according to any one of claims 11 to 13, characterized in that at the beginning the shape and length of the permissible connecting lines as well as the size of the search matrix is determined.

15. The method according to any one of claims 11 to 14, characterized in that it is determined by logical links whether the pixels arranged on the at least one predetermined connecting line are assigned to either the background area or the tolerance area.

16. The method according to claim 15, characterized in that all permissible connecting lines are checked simultaneously.

17. The method according to claim 15, characterized in that all permissible connecting lines are checked within one shift cycle.

18. The method according to any one of claims 1 to 17, characterized in that each pixel associated with the interference information is replaced by a known background value.

19. The method according to any one of claims 1 to 17, characterized in that a binarization is carried out directly on the basis of the assignment of a pixel to useful information or interference information.

20. Device for automatic character recognition, comprising a device for dividing an image area comprising the characters to be recognized into one A plurality of pixels arranged in rows and columns, wherein each pixel can be allocated either useful information attributable to the characters to be recognized or interference information to be suppressed, characterized in that the allocation for each pixel can be carried out based on the pixels arranged in a surrounding area of a predetermined size is.