CN105447489A

CN105447489A - Character and background adhesion noise elimination method for image OCR system

Info

Publication number: CN105447489A
Application number: CN201510776907.1A
Authority: CN
Inventors: 王�忠; 周庆标; 覃方颖
Original assignee: Zhejiang University of Media and Communications
Current assignee: Zhejiang University of Media and Communications
Priority date: 2015-11-13
Filing date: 2015-11-13
Publication date: 2016-03-30
Anticipated expiration: 2035-11-13
Also published as: CN105447489B

Abstract

The invention discloses a character and background adhesion noise elimination method for an image OCR system, comprising the following steps: (1) setting the effective range of the stroke length of a character to be identified; (2) selecting an image containing the character to be identified in a natural environment, and calculating the difference boundary diagram of the image by means of mean difference template; (3) thresholding the obtained difference boundary diagram of the image to be identified to form a ternary-valued boundary diagram of the image to be identified; and (4) carrying out searching in the ternary-valued boundary diagram according to the angle-of-inclination range and the linear boundary. By using the character and background adhesion noise elimination method for the image OCR system of the invention, the problem that a character with adhesion noise is hard to position for the existing OCR application system is solved in a complex and changeable natural environment, the character position can be extracted accurately, and the real-time requirements of engineering application can be satisfied.

Description

A method for eliminating the adhesion noise between characters and background in an image OCR recognition system

技术领域technical field

本发明涉及自然场景图片OCR识别系统，尤其涉及一种图片OCR识别系统的字符与背景粘连噪声消除方法。The invention relates to a natural scene picture OCR recognition system, in particular to a method for eliminating the adhesion noise between characters and background in the picture OCR recognition system.

背景技术Background technique

在人工智能领域，光学字符识别OCR是一项十分重要的技术，由于智能手机的普及以及云存储中海量图片的分类与搜索的需要，使得自然场景图片OCR识别成为近年来研究的一个热点，OCR技术一般包括字符定位，字符切割，字符识别等几个过程，其中字符定位的速度与精度，直接影响OCR识别技术的好坏，是整个OCR系统的关键。In the field of artificial intelligence, optical character recognition (OCR) is a very important technology. Due to the popularity of smart phones and the need for classification and search of massive images in cloud storage, OCR recognition of natural scene images has become a hot research topic in recent years. OCR Technology generally includes several processes such as character positioning, character cutting, and character recognition, among which the speed and accuracy of character positioning directly affect the quality of OCR recognition technology and are the key to the entire OCR system.

OCR字符定位通常利用字符串的结构信息，通过全局搜索的方式对字符串所在区域进行定位，而字符串结构信息的提取，最常用也是较有效的方法是通过二值化技术来提取字符串图像的边缘，常用的二值化方法有固定阈值法、自适应阈值法、全局阈值法以及局部阈值法等，无论哪一种算法，当面对复杂多变的环境，如不同季节、不同天气环境等复杂情况，都会或多或少的引入各种各样的噪声，这些噪声将很容易导致定位失败，或产生大量虚假字符信息，从而使得后续处理的计算量大大增加。OCR character positioning usually uses the structural information of the string to locate the area where the string is located through a global search, and the most common and effective method for extracting the string structure information is to extract the string image through binarization technology The commonly used binarization methods include fixed threshold method, adaptive threshold method, global threshold method and local threshold method. Various noises will be introduced more or less in complicated situations, which will easily lead to positioning failure, or generate a large amount of false character information, thus greatly increasing the calculation amount of subsequent processing.

对二值化所引入的噪声，现有的做法通常是采用一些滤波方法去滤除噪声，例如中值滤波、数学形态学、二维小波分析等，这些不同的方法对不同的图像有不同的效果，常用的线性低通滤波器和邻域平均的方法虽然可以去除部分噪声，但它们具有图像模糊的负作用，中值滤波的方法可以消除孤立的噪声点，而且产生的模糊比较少，但是它对二值图像去除噪声的效果并不好，数学形态学在一定程度上可以将部分黑色斑块腐蚀掉，但这样往往会导致原图像变形加剧，不利于后续的计算。For the noise introduced by binarization, the existing method usually uses some filtering methods to filter out the noise, such as median filtering, mathematical morphology, two-dimensional wavelet analysis, etc. These different methods have different effects on different images. Effect, although the commonly used linear low-pass filter and neighborhood averaging method can remove part of the noise, but they have the negative effect of image blur, the median filtering method can eliminate isolated noise points, and the blurring produced is relatively small, but It is not effective in removing noise from binary images. Mathematical morphology can corrode some black patches to a certain extent, but this often leads to aggravated deformation of the original image, which is not conducive to subsequent calculations.

另外，在自然场景图片OCR识别中，由于光照、天气、杂物等因素的影响，使得二值化噪声更多，实际工程应用中，独立的、离散的噪声往往比较容易区分，而与字符或者待识别对象发生了粘连的噪声往往难以处理，粘连噪声带来的结果就是相应的字符位置范围受到了干扰，导致字符的准确定位失败，从而影响了系统的整体识别率，如何消除自然场景中的字符粘连噪声成了OCR识别中的一个关键问题，因此，提出一种能够满足实际应用需要的、实现各种复杂环境下的实时字符粘连噪声消除技术显得十分重要，通过消除粘连噪声，可减少非字符区域的干扰，从而提供OCR字符定位的准确性。In addition, in the OCR recognition of natural scene pictures, due to the influence of factors such as light, weather, and sundries, there are more binarized noises. In practical engineering applications, independent and discrete noises are often easier to distinguish, while characters or It is often difficult to deal with the noise caused by the adhesion of the objects to be recognized. The result of the adhesion noise is that the corresponding character position range is disturbed, resulting in the failure of accurate positioning of the characters, which affects the overall recognition rate of the system. How to eliminate the noise in natural scenes? Character adhesion noise has become a key problem in OCR recognition. Therefore, it is very important to propose a real-time character adhesion noise elimination technology that can meet the needs of practical applications and realize various complex environments. The interference of the character area, thus improving the accuracy of OCR character positioning.

发明内容Contents of the invention

本发明的目的在于提供一种图片OCR识别系统的字符与背景粘连噪声消除方法，针对现有自然场景图片的OCR技术，在噪声处理方面存在的不足，提出了一种新的粘连噪声消除方法，在复杂多变的自然环境中，该方法解决了现有OCR应用系统对具有粘连噪声字符定位困难的问题，实现字符位置的准确提取，并能满足工程应用的实时性要求。The purpose of the present invention is to provide a method for eliminating the cohesive noise between characters and the background of a picture OCR recognition system. Aiming at the OCR technology of existing natural scene pictures, there are deficiencies in noise processing, and a new method for eliminating cohesive noise is proposed. In the complex and changeable natural environment, this method solves the problem that the existing OCR application system has difficulty in locating characters with cohesive noise, realizes accurate extraction of character positions, and can meet the real-time requirements of engineering applications.

为了达到上述目的，本发明提供的技术方案是：所述的自然场景图片OCR识别系统的字符与背景粘连噪声消除方法包括按顺序执行的下列步骤：In order to achieve the above object, the technical solution provided by the present invention is: the character and background cohesion noise elimination method of the natural scene picture OCR recognition system includes the following steps executed in order:

一种图片OCR识别系统的字符与背景粘连噪声消除方法，所述的图片OCR识别系统的字符与背景粘连噪声消除方法包括按如下步骤：A character and background cohesion noise elimination method of a picture OCR recognition system, the character and background cohesion noise elimination method of the picture OCR recognition system comprises the following steps:

1)根据OCR系统所应用的工程环境的先验知识，设置待识别字符的笔划长度StrokeLen的有效范围；1) According to the prior knowledge of the engineering environment applied by the OCR system, the effective range of the stroke length StrokeLen of the character to be recognized is set;

2)选取自然环境下的含有待识别字符的图像，并以均值差分模板的方式计算图像差分边界图，所述差分边界图包含字符区域和背景区域；2) Select an image containing a character to be recognized in the natural environment, and calculate the image difference boundary map in the form of a mean difference template, and the difference boundary map includes a character area and a background area;

3)将上述得到的待识别图像的差分边界图进行阈值化，形成待识别图像的三值化边界图，所述三值化边界图包含待识别区域和背景区域；3) Thresholding the differential boundary map of the image to be recognized obtained above to form a three-valued boundary map of the image to be recognized, the three-valued boundary map includes a region to be recognized and a background region;

4)在上述得到的待识别图像的三值化边界图中，按照倾斜角度范围内、直线型边界进行搜索，检查该边界是否超出了笔划长度的有效范围，当没有超出笔划长度的有效范围，则认为是有效的边界点，并给予保留，当超出笔划长度的有效范围，则该边界判定为噪声，将其从三值边界图像中清除。4) In the ternary boundary map of the image to be recognized obtained above, search according to the slope angle range and the linear boundary, and check whether the boundary exceeds the valid range of the stroke length. If it does not exceed the valid range of the stroke length, Then it is considered as a valid boundary point and is reserved. When it exceeds the effective range of the stroke length, the boundary is judged as noise, and it is removed from the ternary boundary image.

所述步骤3)中，将所述待识别图像的差分边界图进行阈值化的公式如下： $T h r e s h o l d (P (x, y)) = \{\begin{matrix} c 1, & i f P (x, y) < P a r a m A \\ c 2, & i f P (x, y) > P a r a m B \\ c 3, & O t h e r s \end{matrix}$ In the step 3), the formula for thresholding the differential boundary map of the image to be recognized is as follows: $T h r e the s h o l d (P (x, the y)) = \{\begin{matrix} c 1, & i f P (x, the y) < P a r a m A \\ c 2, & i f P (x, the y) > P a r a m B \\ c 3, & o t h e r the s \end{matrix}$

其中，P(x，y)是差分图中像素点(x，y)的差分值，ParamA和ParamB是预先设置的阈值，c1，c2和c3是无符号整数，取值范围是[0，255]；其中c1表示当前像素点的亮度比周围更亮的边界，c2表示当前像素点的亮度比较周围更暗的边界，c3表示非边界的取值。Among them, P(x, y) is the difference value of the pixel point (x, y) in the difference map, ParamA and ParamB are preset thresholds, c1, c2 and c3 are unsigned integers, and the value range is [0, 255 ]; where c1 represents the border where the brightness of the current pixel is brighter than the surrounding, c2 represents the border where the brightness of the current pixel is darker than the surrounding, and c3 represents the value of the non-border.

所述的c1颜色为黑色，取值为c1＝0；所述的c2颜色为白色，取值为c2＝255，所述的c3颜色为灰色，取值为c3＝128，差分边界像素点的值分别为0、255，而非边界像素点的值为128，即所述的黑色和白色的像素点为边界像素点，所述的灰色的像素点是非边界像素点。The color of c1 is black and its value is c1=0; the color of c2 is white and its value is c2=255; the color of c3 is gray and its value is c3=128; The values are 0 and 255 respectively, and the value of the non-boundary pixels is 128, that is, the black and white pixels are boundary pixels, and the gray pixels are non-boundary pixels.

所述步骤4)中，将所述待识别图像的三值化图像按照倾斜角度范围内、直线型边界进行搜索的算法步骤如下：In the step 4), the algorithm steps of searching the three-valued image of the image to be recognized according to the linear boundary in the range of oblique angles are as follows:

4.1)计算倾斜角度在[α，β]范围内的直线倾斜偏移量检测模版；4.1) Calculate the linear tilt offset detection template with the tilt angle in the range of [α, β];

4.2)对三值化图像中每一个边界点，利用步骤a)中得到的直线检测模版，检查从该边界点出发、边界值相同，并且与模版匹配的连续边界线；4.2) For each boundary point in the three-valued image, use the straight line detection template obtained in step a) to check the continuous boundary line starting from the boundary point, having the same boundary value and matching the template;

4.3)对匹配的连续边界线，检查其长度是否属于有效范围，当属于有效范围，则保留，否则该边界线为噪声边界，并将边界线上所有边界点所包含的边界像素值设置为c3。4.3) For the matched continuous boundary line, check whether its length belongs to the valid range, if it belongs to the valid range, keep it, otherwise the boundary line is a noise boundary, and set the boundary pixel values contained in all boundary points on the boundary line to c3 .

所述步骤4-1)中，所述计算倾斜角度在[α，β]范围内的直线检测模版的算法步骤如下：In the step 4-1), the algorithm steps for calculating the straight line detection template with an inclination angle in the range of [α, β] are as follows:

4.1.1)根据步骤1)所设置的最长笔划长度StrokeLen，按照工程实际情况计算直线模版数量Num、直线模版线条的长度Len以及该模版在y方向上的最大偏移值MaxOffY，计算公式分别如下；4.1.1) According to the longest stroke length StrokeLen set in step 1), calculate the number of straight-line templates Num, the length Len of the straight-line template lines and the maximum offset value MaxOffY of the template in the y direction according to the actual situation of the project. The calculation formulas are respectively as follows;

$\{\begin{matrix} N N u u m m = = 44 S S t t r r o o k k e e L L e e n no \\ L L e e n no = = 22 S S t t r r o o k k e e L L e e n no \\ M m a a x x O o f f f f Y Y = = L L e e n no \end{matrix}$

4.1.2)对每一个直线倾斜偏移量模版，按如下公式计算y方向的偏移值。4.1.2) For each linear tilt offset template, calculate the offset value in the y direction according to the following formula.

$y the y = = \frac{x x * * O o f f f f Y Y}{L L e e n no},, x x &Element; &Element; [[00,, L L e e n no]],, O o f f f f Y Y &Element; &Element; [[- - M m a a x x o o f f f f Y Y,, M m a a x x O o f f f f Y Y]]$

所述步骤4.2)中将待识别图像的三值化图像进行直线型边界的搜索算法，判断从某边界点出发、边界值相同，并且与直线倾斜偏移量模版匹配的连续边界线的方法如下：In the step 4.2), the three-valued image of the image to be recognized is subjected to a linear boundary search algorithm, and the method of judging a continuous boundary line starting from a certain boundary point, having the same boundary value, and matching the straight-line tilt offset template is as follows :

4.2.1)当所述直线的线宽为1的普通直线，该方法从出发点开始，从左到右检查是否有直接相邻的等值边界点，当存在，则该直线上所有边界点都属于匹配点，否则就属于不匹配点；4.2.1) When the line width of the straight line is an ordinary straight line of 1, the method starts from the starting point and checks from left to right whether there are directly adjacent equivalent boundary points, and if there is, all boundary points on the straight line are It belongs to the matching point, otherwise it belongs to the non-matching point;

4.2.2)当所述的直线线宽为N，基于八连通的广义直线，该方法从出发点开始，检测八连通意义下的等值边界点，当存在这样的广义直线，则该广义直线上所有边界点都属于匹配点，否则就属于不匹配点。4.2.2) When the line width of the straight line is N, based on the eight-connected generalized straight line, the method starts from the starting point and detects the equivalent boundary point under the eight-connected sense. When there is such a generalized straight line, then on the generalized straight line All boundary points belong to matching points, otherwise they belong to non-matching points.

本发明的有益效果是：本发明的一种图片OCR识别系统的字符与背景粘连噪声消除方法，针对现有自然场景图片的OCR技术，在噪声处理方面存在的不足，提出了一种新的粘连噪声消除方法，在复杂多变的自然环境中，该方法解决了现有OCR应用系统对具有粘连噪声字符定位困难的问题，实现字符位置的准确提取，并能满足工程应用的实时性要求。The beneficial effects of the present invention are: the character and background adhesion noise elimination method of a picture OCR recognition system of the present invention proposes a new adhesion Noise elimination method, in the complex and changeable natural environment, this method solves the problem that the existing OCR application system has difficulty in locating characters with cohesive noise, realizes accurate extraction of character positions, and can meet the real-time requirements of engineering applications.

附图说明Description of drawings

图1为本发明实施例的一种图片OCR识别系统的字符与背景粘连噪声消除方法的流程图；Fig. 1 is the flow chart of the character and background adhesion noise elimination method of a kind of picture OCR recognition system of the embodiment of the present invention;

图2为本发明实施例的均值差分模板；Fig. 2 is the mean difference template of the embodiment of the present invention;

图3为本发明实施例的一种扫描差分边界广义直线的示意图，图中空白的矩形块表示非边界像素点，图中非空白矩形块表示边界像素点；Fig. 3 is a schematic diagram of a generalized straight line of a scanning differential boundary according to an embodiment of the present invention, the blank rectangular blocks in the figure represent non-boundary pixels, and the non-blank rectangular blocks in the figure represent boundary pixels;

图4为本发明实施例对所述三值化图像进行直线型广义直线边界进行搜索的流程图；Fig. 4 is a flow chart of searching for a straight-line generalized straight-line boundary on the three-valued image according to an embodiment of the present invention;

图5为本发明计算倾斜角度在[α，β]范围内的直线检测模版流程图；Fig. 5 is the flow chart of the straight line detection template for calculating the tilt angle in the range of [α, β] according to the present invention;

图6为本发明实施例中由差分边界像素组成的普通直线示意图；FIG. 6 is a schematic diagram of an ordinary straight line composed of differential boundary pixels in an embodiment of the present invention;

图7为本发明为广义直线判断中用到的、基于八连通意义下的相邻像素示意图；Fig. 7 is a schematic diagram of adjacent pixels based on the eight-connected meaning used in the generalized straight line judgment of the present invention;

图8为本发明宽度为2的广义直线示意图；Fig. 8 is a schematic diagram of a generalized straight line with a width of 2 in the present invention;

图9为本发明宽度为3的广义直线示意图；Fig. 9 is a schematic diagram of a generalized straight line with a width of 3 in the present invention;

图10为本发明实施例应用到第1张有粘连的车牌处理前的图片；Fig. 10 is the picture before the embodiment of the present invention is applied to the first license plate with adhesion;

图11为本发明实施例应用到第1张车牌处理后的效果图；Fig. 11 is an effect diagram after the embodiment of the present invention is applied to the first license plate processing;

图12为本发明实施例应用到第2张有粘连的车牌处理前的图片；Fig. 12 is the picture before the embodiment of the present invention is applied to the second license plate with adhesion;

图13为本发明实施例应用到第2张车牌处理后的效果图；Fig. 13 is an effect diagram after the embodiment of the present invention is applied to the second license plate processing;

图14为本发明实施例应用到第3张有粘连的车牌处理前的图片；Fig. 14 is the picture before the embodiment of the present invention is applied to the third license plate with adhesion;

图15为本发明实施例应用到第3张车牌处理后的效果图；Fig. 15 is the effect diagram after the embodiment of the present invention is applied to the processing of the 3rd license plate;

图16为本发明实施例应用到第4张有粘连的车牌处理前的图片；Fig. 16 is the picture before the embodiment of the present invention is applied to the 4th license plate with adhesion;

图17为本发明实施例应用到第4张车牌处理后的效果图；Fig. 17 is the effect diagram after the embodiment of the present invention is applied to the processing of the 4th license plate;

图18为本发明实施例应用到第5张有粘连的车牌处理前的图片；Fig. 18 is the picture before the embodiment of the present invention is applied to the fifth license plate with adhesion;

图19为本发明实施例应用到第5张车牌处理后的效果图。Fig. 19 is an effect diagram after the embodiment of the present invention is applied to the processing of the fifth license plate.

具体实施方式detailed description

下面结合附图对本发明作进一步详细说明。The present invention will be described in further detail below in conjunction with the accompanying drawings.

如图1所示，为本发明实施例一种图片OCR识别系统的字符与背景粘连噪声消除方法的流程图，所述方法包括：As shown in Figure 1, it is a flow chart of a method for eliminating the adhesion noise between characters and background of a picture OCR recognition system according to an embodiment of the present invention, and the method includes:

1)根据OCR系统所应用的工程环境的先验知识，设置待识别字符的笔划长度StrokeLen的有效范围。该参数与实际的工程有密切的关系，实际应用过程中要设置为可配置的参数。当该方法应用到卡口车牌图像的OCR识别时，该参数选择为35个像素。在交通电子警察图片的OCR识别过程中，该参数选择为30个像素。1) According to the prior knowledge of the engineering environment applied by the OCR system, the effective range of the stroke length StrokeLen of the character to be recognized is set. This parameter is closely related to the actual project, and should be set as a configurable parameter in the actual application process. When this method is applied to the OCR recognition of bayonet license plate images, this parameter is selected as 35 pixels. In the OCR recognition process of traffic electronic police pictures, this parameter is selected as 30 pixels.

2)选取自然环境下的含有待识别字符的图像，并以如图2所示的均值差分模板的方式计算图像差分边界图，所述差分边界图包含字符区域和背景区域。2) Select an image containing characters to be recognized in the natural environment, and calculate the image difference boundary map in the form of the mean difference template as shown in Figure 2, and the difference boundary map includes a character area and a background area.

3)将所述待识别图像的差分边界图进行阈值化，形成待识别图像的三值化边界图，所述待识别图像包含待识别区域和背景区域，将所述待识别图像的差分图按照如下公式进行三值化：3) Thresholding the differential boundary map of the image to be recognized to form a three-valued boundary map of the image to be recognized, the image to be recognized includes a region to be recognized and a background region, and the difference map of the image to be recognized is according to The following formula is used for ternaryization:

$T T h h r r e e s the s h h o o l l d d ((P P ((x x \cdot \cdot y the y)))) = = \{\begin{matrix} c c 11,, & i i f f P P ((x x \cdot &Center Dot; y the y)) < < P P a a r r a a m m A A \\ c c 22,, & i i f f P P ((x x,, y the y)) > > P P a a r r a a m m B B \\ c c 33,, & O o t t h h e e r r s the s \end{matrix}$

其中，P(x，y)是差分图中像素点(x，y)的差分值，ParamA和ParamB是预先设置的阈值，c1，c2和c3的取值范围是[0，255]，其中c1表示当前像素点的亮度比周围更亮的边界，c2表示当前像素点的亮度比较周围更暗的边界，c3表示非边界的取值。在工程中，为了方便调试，三个常量的取值应该尽可能有比较大的差异性，所述的c1颜色为黑色，取值为c1＝0；所述的c2颜色为白色，取值为c2＝255，所述的c3颜色为灰色，取值为c3＝128，差分边界像素点的值分别为0、255，而非边界像素点的值为128，即所述的黑色和白色的像素点为边界像素点，所述的灰色的像素点是非边界像素点。Among them, P(x, y) is the difference value of the pixel point (x, y) in the difference map, ParamA and ParamB are preset thresholds, and the value range of c1, c2 and c3 is [0, 255], where c1 Indicates the boundary where the brightness of the current pixel is brighter than the surrounding, c2 indicates the boundary where the brightness of the current pixel is darker than the surrounding, c3 indicates the value of the non-boundary. In the project, in order to facilitate debugging, the values of the three constants should have relatively large differences as much as possible. The color of c1 is black, and the value is c1=0; the color of c2 is white, and the value is c2=255, the color of c3 is gray, the value is c3=128, the values of the difference border pixels are 0, 255 respectively, and the value of non-border pixels is 128, that is, the black and white pixels Points are boundary pixels, and the gray pixels are non-boundary pixels.

4)在上述得到的待识别图像的三值化边界图中，按照倾斜角度范围内、直线型边界进行搜索，检查该边界是否超出了笔划长度的有效范围，如果没有超出笔划长度的有效范围，则认为是有效的边界点并给予保留，如果超出笔划长度的有效范围，则该边界判定为噪声边界，将其从三值边界图像中清除，如图3所示，其中表示出了三次检测过程，分别是从差分边界像素点A、C和E出发，检测了Num个方向的直线型边界，在图3中，有两条直线型差分边界，分别是AB和CD，而E则是一个孤立的边界点，如果AB和CD超出笔划长度的有效范围，就将其颜色值设置为非边界值c2，这样就清除了该直线型边界。4) In the three-valued boundary map of the image to be recognized obtained above, search according to the slope angle range and the straight line boundary, and check whether the boundary exceeds the valid range of the stroke length, if it does not exceed the valid range of the stroke length, It is considered to be an effective boundary point and reserved. If it exceeds the effective range of the stroke length, the boundary is judged as a noise boundary, and it is removed from the three-valued boundary image, as shown in Figure 3, which shows three detection processes , starting from the differential boundary pixel points A, C and E respectively, the linear boundaries in Num directions are detected. In Figure 3, there are two linear differential boundaries, namely AB and CD, and E is an isolated , if AB and CD exceed the valid range of the stroke length, their color values are set to the non-boundary value c2, thus clearing the linear boundary.

经过上述步骤处理后的差分图，就是消除了粘连噪声的图像，此时就可以将其应用到OCR字符识别的后续流程中。The difference image processed by the above steps is the image with the adhesion noise eliminated, and it can be applied to the subsequent process of OCR character recognition at this time.

在上述过程中，直线型边界的搜索尤其重要，如图4所示，为本发明对三值化图像进行直线型边界搜索的流程图，所述流程如下：In the above-mentioned process, the search of linear boundary is particularly important, as shown in Figure 4, it is the flow chart that the present invention carries out linear boundary search to three-valued image, and described process is as follows:

步骤4.1)计算倾斜角度在[α，β]范围内的直线倾斜偏移量检测模版，这里所涉及的两个参数实际上表示了OCR字符的倾斜角度范围，最小倾斜角度是α，最大倾斜角度是β，它们的选择对噪声的清除效果以及性能有重要的影响，实际应用过程中要设置为可配置的参数，当该方法应用到卡口车牌图像的OCR识别时，参数选择可以为α＝-15°，β＝15°，当应用到交通电子警察图片的OCR识别时，建议参数选择为α＝-25°，β＝25°，当应用到交通违停取证图片的OCR识别时，由于违停车牌倾斜角度往往比较大，参数建议选择为α＝-45°，β＝45°；Step 4.1) Calculate the linear tilt offset detection template with the tilt angle in the range of [α, β]. The two parameters involved here actually represent the tilt angle range of the OCR character, the minimum tilt angle is α, and the maximum tilt angle is β, their selection has an important impact on the noise removal effect and performance, and it should be set as a configurable parameter in the actual application process. When this method is applied to the OCR recognition of bayonet license plate images, the parameter selection can be α= -15°, β=15°, when applied to the OCR recognition of traffic electronic police pictures, the recommended parameter selection is α=-25°, β=25°, when applied to the OCR recognition of traffic violation parking forensics pictures, due to The inclination angle of illegal parking signs is often relatively large, and the parameters are recommended to be α=-45°, β=45°;

步骤4.2)对三值化图像中每一个边界点，利用步骤4.1)中所述的直线检测模版，检查从该边界点出发、边界值相同，并且与模版匹配的连续边界线；Step 4.2) For each boundary point in the three-valued image, use the straight line detection template described in step 4.1) to check the continuous boundary line starting from the boundary point, having the same boundary value and matching the template;

步骤4.3)对匹配的连续边界线，检查其长度是否属于有效范围。如果属于有效范围，则保留，否则该边界线为噪声边界，并将边界线上所有边界点所包含的边界像素值设置为c3，这里所述的连续边界线的长度有效范围，指的是笔划长度的有效范围。Step 4.3) For the matched continuous boundary line, check whether its length belongs to the valid range. If it belongs to the valid range, keep it, otherwise the boundary line is a noise boundary, and set the boundary pixel values contained in all boundary points on the boundary line to c3, the effective range of the length of the continuous boundary line mentioned here refers to the stroke The valid range of lengths.

对前述直线检测模版，其目的是为了加快搜索的速度，使得整个算法满足实际应用中对实时性的要求，直线检测模版有若干个，每一个模版针对应某一个特定的倾斜角，其中所保存信息是倾斜直线上每个y坐标与水平线之间的偏移量，图5是直线检测算法的流程图，具体操作如下：For the aforementioned straight line detection template, its purpose is to speed up the search, so that the entire algorithm meets the real-time requirements in practical applications. There are several straight line detection templates, and each template corresponds to a specific inclination angle. The information is the offset between each y coordinate on the inclined line and the horizontal line. Figure 5 is a flow chart of the line detection algorithm. The specific operations are as follows:

步骤4.1.1)根据所述步骤1)中所设置的最长笔划长度StrokeLen，按照工程实际情况计算直线模版数量Num、直线模版线条的长度Len以及y的最大偏移值MaxOffY。计算公式如下：Step 4.1.1) According to the longest stroke length StrokeLen set in the step 1), calculate the number of straight line templates Num, the length Len of the straight line template lines and the maximum offset value MaxOffY of y according to the actual situation of the project. Calculated as follows:

$\{\begin{matrix} N N u u m m = = 44 S S t t r r o o k k e e L L e e n no \\ L L e e n no = = 22 S S t t r r o o k k e e L L e e n no \\ M m a a x x O o f f f f Y Y = = L L e e n no \end{matrix};;$

步骤4.1.2)对每一个直线倾斜偏移量模版，按如下公式计算y方向的偏移值。Step 4.1.2) For each straight-line tilt offset template, calculate the offset value in the y direction according to the following formula.

三值化图像中直线型边界的搜索是本方法的关键，对每一个差分边界点，从该点出发，用直线检测模版去匹配，检查是否存在连续差分边界像素点构成的广义直线型边界，对广义直线检测，按如下方法：The search for the linear boundary in the three-valued image is the key of this method. For each differential boundary point, starting from this point, use the straight line detection template to match and check whether there is a generalized linear boundary composed of continuous differential boundary pixels. For generalized straight line detection, the method is as follows:

4.2.1)当所述直线的线宽为1的普通直线,该方法从出发点开始，从左到右检查是否有直接相邻的等值边界点,如果存在，则该边界点属于匹配点,否则就属于不匹配点,如图6所示，在普通直线的定义下，该图中有两条边界直线，分别是AB和CD。4.2.1) When the straight line is an ordinary straight line with a line width of 1, the method starts from the starting point and checks from left to right whether there is a directly adjacent equivalent boundary point. If there is, the boundary point belongs to the matching point. Otherwise, it is a non-matching point. As shown in Figure 6, under the definition of ordinary straight lines, there are two boundary straight lines in this figure, namely AB and CD.

4.2.2)当所述直线的线宽为N、基于八连通的广义直线(如图7所示)，该方法从出发点开始，检测八连通范围内的等值差分边界点，如果存在这样的边界点，则该边界点属于匹配点，否则就属于不匹配点。这里所述的八连通的含义如图7所示，如果两个差分边界点满足其中所示的位置关系，则成这两个差分边界点是八连通相邻的，根据该定义，如图8、9所示，其中就给出了两条广义直线的示例，在图8中，广义直线AB从起点A一直延伸到了B，宽度为2，水平长度为13，在图9中，广义直线CD从起点C一直延伸到了D，宽度为3，水平长度为16，按照这种方式定义出来的广义直线对工程中的粘连噪声有很高的匹配度，将这样的广义直线从差分边界图中清除掉，可大大降低噪声、特别是粘连噪声的影响，提高OCR字符定位的精度，在实际工程的应用中，参数N通常小于4。4.2.2) When the line width of the straight line is N, based on an eight-connected generalized straight line (as shown in Figure 7), the method starts from the starting point and detects the equivalent difference boundary points in the eight-connected range, if there is such boundary point, the boundary point is a matching point, otherwise it is a non-matching point. The meaning of the eight-connectedness described here is shown in Figure 7. If two differential boundary points satisfy the positional relationship shown therein, then these two differential boundary points are eight-connected adjacent. According to this definition, as shown in Figure 8 , 9, which gives examples of two generalized straight lines. In Figure 8, the generalized straight line AB extends from the starting point A to B, with a width of 2 and a horizontal length of 13. In Figure 9, the generalized straight line CD Extending from the starting point C to D, the width is 3, and the horizontal length is 16. The generalized straight line defined in this way has a high degree of matching for the adhesion noise in the project. Clear such a generalized straight line from the differential boundary map It can greatly reduce the influence of noise, especially the adhesion noise, and improve the accuracy of OCR character positioning. In practical engineering applications, the parameter N is usually less than 4.

本发明实施例提出一种图片OCR识别系统的字符与背景粘连噪声消除方法，根据工程实际情况，设置好相应的参数后，然后利用均值差分模版计算并得到图像的三值化差分图，最后，在所有差分边界中搜索满足相关条件的广义直线，该直线就是干扰噪声，通过消除该广义直线相关的差分边界点，待识别图片中的噪声信息被大大抑制，并最终使得候选字符区域被准确的定位，从而提高OCR字符识别最终的正确率。The embodiment of the present invention proposes a method for eliminating the adhesion noise between characters and the background of the image OCR recognition system. According to the actual situation of the project, after setting the corresponding parameters, then use the mean value difference template to calculate and obtain the three-valued difference map of the image, and finally, Search for a generalized straight line that satisfies the relevant conditions in all differential boundaries. This straight line is the interference noise. By eliminating the differential boundary points related to the generalized straight line, the noise information in the picture to be recognized is greatly suppressed, and finally the candidate character area is accurately identified. Positioning, thereby improving the final correct rate of OCR character recognition.

图10-19所示，是本发明应用到车牌识别中的效果图，其中给出了5张有粘连的车牌图像，如图10所示，车牌中间的四个字符应用粘连噪声的原因被连接在一起，如果按照传统连通分量进行车牌定位时，这四个字符形成的连通分量就会被认为是一个整体，而应用本发明后，如图11所示，这四个字符被完全分割开了，从而有利于车牌的定位，如图12、14、16、18所述，这些车牌也是类似的效果，也就是经过本发明处理后，如图13、15、17、19所示，都能够有效的将粘连噪声消除，使得其中的字符能够明确的分割处理，从而提高系统的识别率。As shown in Figure 10-19, it is the effect diagram of the application of the present invention in license plate recognition, in which 5 license plate images with adhesion are given, as shown in Figure 10, the four characters in the middle of the license plate are connected due to the application of adhesion noise Together, if the license plate location is performed according to the traditional connected components, the connected components formed by these four characters will be considered as a whole, and after the application of the present invention, as shown in Figure 11, these four characters are completely separated , so as to facilitate the positioning of the license plate, as described in Figures 12, 14, 16, and 18, these license plates also have similar effects, that is, after being processed by the present invention, as shown in Figures 13, 15, 17, and 19, they can all be effective Eliminate the cohesive noise so that the characters in it can be clearly segmented and processed, thereby improving the recognition rate of the system.

本实施例的一种图片OCR识别系统的字符与背景粘连噪声消除方法，针对现有自然场景图片的OCR技术，在噪声处理方面存在的不足，提出了一种新的粘连噪声消除方法，在复杂多变的自然环境中，该方法解决了现有OCR应用系统对具有粘连噪声字符定位困难的问题，实现字符位置的准确提取，并能满足工程应用的实时性要求。The character and background cohesion noise elimination method of a kind of picture OCR recognition system of this embodiment, aim at the OCR technology of existing natural scene picture, the deficiency that exists in the aspect of noise processing, propose a kind of new cohesion noise elimination method, in complex In the changeable natural environment, this method solves the problem that the existing OCR application system has difficulty in locating characters with cohesive noise, realizes accurate extraction of character positions, and can meet the real-time requirements of engineering applications.

本发明所属领域的一般技术人员可以理解，本发明以上实施例仅为本发明的优选实施例之一，为篇幅限制，这里不能逐一列举所有实施方式，任何可以体现本发明权利要求技术方案的实施，都在本发明的保护范围内。Those of ordinary skill in the field of the present invention can understand that the above embodiment of the present invention is only one of the preferred embodiments of the present invention, and is limited by space. All implementation modes cannot be listed here one by one, and any implementation that can embody the technical solution of the claims of the present invention , are all within the protection scope of the present invention.

需要注意的是，以上内容是结合具体的实施方式对本发明所作的进一步详细说明，不能认定本发明的具体实施方式仅限于此，在本发明的上述指导下，本领域技术人员可以在上述实施例的基础上进行各种改进和变形，而这些改进或者变形落在本发明的保护范围内。It should be noted that the above content is a further detailed description of the present invention in conjunction with specific embodiments, and it cannot be determined that the specific embodiments of the present invention are limited thereto. Under the guidance of the present invention, those skilled in the art can Various improvements and modifications are made on the basis of the invention, and these improvements or modifications fall within the protection scope of the present invention.

Claims

1. the character of picture OCR recognition system and a background adhesion noise cancellation method, is characterized in that: character and the background adhesion noise cancellation method of described picture OCR recognition system comprise as follows:

1) priori of the engineering-environment applied according to OCR system, arranges the effective range of the stroke length StrokeLen of character to be identified;

2) choose the image containing character to be identified under physical environment, and with the mode computed image difference boundary graph of mean difference template, described difference boundary graph comprises character zone and background area;

3) the difference boundary graph of image to be identified obtained above is carried out thresholding, form the three-valued boundary graph of image to be identified, described three-valued boundary graph comprises region to be identified and background area;

4) in the three-valued boundary graph of image to be identified obtained above, according in range of tilt angles, linear boundary searches for, check this border whether beyond the effective range of stroke length, when not exceeding the effective range of stroke length, then think effective frontier point, and retain, when exceeding the effective range of stroke length, then this edge determination is noise, it is removed from three value boundary images.

2. the character of a kind of picture OCR recognition system according to claim 1 and background adhesion noise cancellation method, is characterized in that: described step 3) in, by as follows for the formula that the difference boundary graph of described image to be identified carries out thresholding:

T h r e s h o l d (P (x, y)) = \{\begin{matrix} c 1, & i f P (x, y) < P a r a m A \\ c 2, & i f P (x, y) > P a r a m B \\ c 3, & O t h e r s \end{matrix}

Wherein, P (x, y) is the difference value of pixel (x, y) in difference diagram, ParamA and ParamB is the threshold value pre-set, c1, c2 and c3 are signless integers, and span is [0,255], wherein c1 represents border brighter around the brightness ratio of current pixel point, and c2 represents the border that the brightness ratio of current pixel point is darker comparatively around, and c3 represents the value on non-border.

3. the character of a kind of picture OCR recognition system according to claim 2 and background adhesion noise cancellation method, it is characterized in that: described c1 color is black, value is c1=0; Described c2 color is white, value is c2=255, described c3 color is grey, value is c3=128, the value of difference boundary pixel point is respectively 0,255, but not the value of boundary pixel point is 128, the pixel of namely described black and white is boundary pixel point, and the pixel of described grey is non-boundary pixel point.

4. the character of a kind of picture OCR recognition system according to claim 1 and background adhesion noise cancellation method, it is characterized in that: described step 4) in, by the three-valued image of described image to be identified according in range of tilt angles, the linear boundary algorithm steps that carries out searching for is as follows:

4.1) calculate the straight line declining displacement of angle of inclination in [α, β] scope and detect masterplate;

4.2) to each frontier point in three-valued image, the straight-line detection masterplate obtained in utilizing step a), checks from this frontier point, boundary value identical, and with the continuum boundary line of stencil matching;

4.3) to the continuum boundary line of coupling, checking whether its length belongs to effective range, when belonging to effective range, then retaining, otherwise this boundary line is noise margin, and the border pixel values that frontier points all on boundary line comprise is set to c3.

5. the character of a kind of picture OCR recognition system according to claim 3 and background adhesion noise cancellation method, it is characterized in that: described step 4-1) in, the algorithm steps of the straight-line detection masterplate of described calculating angle of inclination in [α, β] scope is as follows:

4.1.1) according to described step 1) in set the longest stroke length StrokeLen, according to engineering practice calculated line masterplate quantity Num, the length Len of straight line masterplate lines and the maximum deviation value MaxOffY of y.Computing formula is as follows:

\{\begin{matrix} N u m = 4 S t r o k e L e n \\ L e n = 2 S t r o k e L e n \\ M a x O f f Y = L e n \end{matrix};

4.1.2) to each straight line declining displacement masterplate, the off-set value in y direction is calculated as follows:

y = \frac{x * o f f Y}{L e n}, x [0, L e n], O f f Y &Element; [- M a x O f f Y, M a x O f f Y] .

6. the character of a kind of picture OCR recognition system according to claim 3 and background adhesion noise cancellation method, it is characterized in that: described step 4.2) in the three-valued image of image to be identified carried out the searching algorithm of linear boundary, judge from certain frontier point, boundary value identical, and as follows with the method for the continuum boundary line of straight line declining displacement stencil matching:

4.2.1) when the live width of described straight line is the generic linear of 1, the method, from starting point, from left to right checks whether the equivalent boundary of direct neighbor, works as existence, then on this straight line, all frontier points all belong to match point, otherwise just belong to not match point;

4.2.2) when described straight line live width is N, based on the general line of eight connectivity, the method is from starting point, detect the equivalent boundary under eight connectivity meaning, when there is such general line, then on this general line, all frontier points all belong to match point, otherwise just belong to not match point.